C.Lammerzahl C.W.F.Everitt F.W.Hehl (Eds.) 



Gyros, Clocks, 
Interferometers...: 
Testing Relativistic 
Gravity in Space 





Lecture Notes in Physics 



Editorial Board 

R. Beig, Wien, Austria 

J. Ehlers, Potsdam, Germany 
U. Frisch, Nice, France 

K. Hepp, Zurich, Switzerland 

W. Hillebrandt, Garching, Germany 
D. Imboden, Zurich, Switzerland 
R. L. Jaffe, Gambridge, MA, USA 
R. Kippenhahn, Gottingen, Germany 
R. Lipowsky, Golm, Germany 

H. V. Lohneysen, Karlsruhe, Germany 

I. Ojima, Kyoto, Japan 

H. A. Weidenmiiller, Heidelberg, Germany 

J. Wess, Miinchen, Germany 
J. Zittartz, Koln, Germany 



Springer 

Berlin 

Heidelberg 

New York 

Barcelona 

Hong Kong 

London 

Milan 

Paris 

Singapore 

Tokyo 



Physics and Astronomy 



ONLINE LIBRARY 



http://www.springer.de/phys/ 




Editorial Policy 

The series Lecture Notes in Physics (LNP), founded in 1969, reports new developments in 
physics research and teaching — quickly, informally but with a high quality. Manuscripts 
to be considered for publication are topical volumes consisting of a limited number of 
contributions, carefully edited and closely related to each other. Each contribution should 
contain at least partly original and previously unpublished material, be written in a clear, 
pedagogical style and aimed at a broader readership, especially graduate students and 
nonspecialist researchers wishing to familiarize themselves with the topic concerned. For 
this reason, traditional proceedings cannot be considered for this series though volumes 
to appear in this series are often based on material presented at conferences, workshops 
and schools (in exceptional cases the original papers and/or those not included in the 
printed book may be added on an accompanying CD ROM, together with the abstracts 
of posters and other material suitable for publication, e.g. large tables, colour pictures, 
program codes, etc.). 

Acceptance 

A project can only be accepted tentatively for publication, by both the editorial board and the 
publisher, following thorough examination of the material submitted. The book proposal 
sent to the publisher should consist at least of a preliminary table of contents outlining the 
structure of the book together with abstracts of all contributions to be included. 

Final acceptance is issued by the series editor in charge, in consultation with the publisher, 
only after receiving the complete manuscript. Final acceptance, possibly requiring minor 
corrections, usually follows the tentative acceptance unless the final manuscript differs 
significantly from expectations (project outline). In particular, the series editors are entitled 
to reject individual contributions if they do not meet the high quality standards of this 
series. The final manuscript must be camera-ready, and should include both an informative 
introduction and a sufficiently detailed subject index. 

Contractual Aspects 

Publication in LNP is free of charge. There is no formal contract, no royalties are paid, 
and no bulk orders are required, although special discounts are offered in this case. The 
volume editors receive jointly 30 free copies for their personal use and are entitled, as are the 
contributing authors, to purchase Springer books at a reduced rate. The publisher secures 
the copyright for each volume. As a rule, no reprints of individual contributions can be 
supplied. 



Manuscript Submission 

The manuscript in its final and approved version must be submitted in camera-ready form. 
The corresponding electronic source files are also required for the production process, in 
particular the online version. Technical assistance in compiling the final manuscript can be 
provided by the publisher’s production editor(s), especially with regard to the publisher’s 
own Latex macro package which has been specially designed for this series. 

Online Version/ LNP Homepage 

LNP homepage (list of available titles, aims and scope, editorial contacts etc.): 
http://www.springer.de/phys/books/lnpp/ 

LNP online (abstracts, full-texts, subscriptions etc.): 
http://link.springer.de/series/lnpp/ 




C. Lammerzahl C. W. R Everitt R W. Hehl (Eds.) 



Gyros, Clocks, 
Interferometers . . . : 
Testing Relativistic 
Gravity in Space 




Editors 



Claus Lammerzahl 
Institute for Experimental Physics 
Heinrich-Heine-University Diisseldorf 
40225 Diisseldorf, German 

C. W. Francis Everitt 

W.W. Hansen Experimental Physics Laboratory 
Stanford University 
Stanford, CA 94305, USA 

Friedrich W. Hehl 
Institute for Theoretical Physics 
University of Cologne 
50923 Koln, Germany 



Cover picture: see contribution by N. Lockerbie in this volume (Fig. 1, modified). 

Library of Congress Cataloging-in-Publication Data applied for. 

Die Deutsche Bibliothek - CIP-Einheitsaufnahme 

Gyros, clocks, interferometers ... : testing relativistic gravity in 
space / C. Lammerzahl ... (ed.). - Berlin ; Heidelberg ; New York ; 

Barcelona ; Hong Kong ; London ; Milan ; Paris ; Singapore ; Tokyo : 

Springer, 2001 

(Lecture notes in physics ; Vol. 562) 

(Physics and astronomy online library) 

ISBN 3-540-41236-0 

ISSN 0075-8450 

ISBN 3-540-41236-0 Springer- Verlag Berlin Heidelberg New York 

This work is subject to copyright. All rights are reserved, whether the whole or part of the 
material is concerned, specifically the rights of translation, reprinting, reuse of illustra- 
tions, recitation, broadcasting, reproduction on microfilm or in any other way, and 
storage in data banks. Duplication of this publication or parts thereof is permitted only 
under the provisions of the German Copyright Law of September 9, 1965, in its current 
version, and permission for use must always he obtained from Springer- Verlag. Violations 
are liable for prosecution under the German Copyright Law. Springer- Verlag Berlin Hei- 
delberg New York 

a member of BertelsmannSpringer Science-l-Business Media GmbH @ Springer-Verlag 
Berlin Heidelberg 2001 

Printed in Germany The use of general descriptive names, registered names, trademarks, 
etc. in this publication does not imply, even in the absence of a specific statement, that such 
names are exempt from the relevant protective laws and regulations and therefore free for 
general use. 

Typesetting: Camera-ready by the authors/editors 

Camera-data conversion by Steingraeber Satztechnik GmbH Heidelberg 

Cover design: design & production, Heidelberg 

Printed on acid-free paper 

SPIN: 10786632 55/3141/du - 5 4 3 2 1 0 





Preface 



Nowadays, Experimental Gravitation is one of the exciting areas in modern 
physics. Many important experiments have entered their final state of realization: 
In 2001 the interferometric gravitational wave detectors will start to collect data, 
in May 2002 Gravity Probe B (Stanford orbiting gyroscope experiment) will be 
launched, and, in 2005, STEP (the Satellite Test of the Equivalence Principle) 
is planned to be put in orbit. All these experiments will give new momentum 
to the field of experimental gravitation and, of course, to the development of 
theoretical concepts in gravity. 

Besides this, the development of many new devices for carrying through high- 
precision measurements will open up new areas for testing the gravitational in- 
teraction. New atomic clocks, such as the atomic fountain clock, for example, 
or clocks which will be based on Bose-Einstein condensates, will lead to an 
increased precision, giving better results of tests of, for example, the gravita- 
tional red shift, or will enable the measurement of gravitomagnetic effects for 
clocks. Also, new quantum devices like atom interferometers or interferometers 
with Bose-Einstein condensates will give much better results while probing the 
gravitational field. And, last but not least, clocks will have an application in 
establishing a global reference system which is used for the GPS (Global Posi- 
tioning System) and for telecommunication. 

From the theoretical point of view all these experiments are highly interest- 
ing and the new results will stimulate the effort of unifying the four interactions 
and/or finding a consistent combination of gravitation theory and quantum the- 
ory, i.e., a quantum gravity theory. It is a general feature of such generalized 
theories of gravity that extra scalar interactions emerge accompanying the grav- 
itational interaction. As one consequence they lead to a violation of the Equiva- 
lence Principle. Moreover, anomalous spin interactions may arise in that context. 
Therefore, tests of gravity may be at the same time tests of our microscopic view 
of the world. 

Experiments are designed for testing the predictions of a theory as well as for 
testing the foundations of theories. In these proceedings both aspects are repre- 
sented. The Lense-Thirring effect and gravitational waves are two of the main 
predictions of General Relativity which have not yet been confirmed directly or 
with convincing precision. As far as the foundations of General Relativity are 
concerned, we deal with the test of the Equivalence Principle which is at the very 
heart of the geometric nature of General Relativity. This principle is character- 
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istic for this kind of interaction. Other topics in this volume are experiments 
with clocks that measure the universality of the gravitational red shift. A vio- 
lation of this universality implies the existence of more than one gravitational 
interaction. In addition, since the most precise clocks are atomic clocks, this 
class of experiments also yields information about the action of gravity in the 
quantum domain. Therefore, the application of new quantum-based devices, like 
atomic interferometers, has been examined for measurements of gravitational in- 
teraction. These and other devices seem to be very promising for increasing the 
experimental precision considerably. Thus, the effects of gravitation on quantum 
systems with spin are part of these proceedings. All these attempts will provide 
a better experimental foundation for the theory of gravity. 

Accordingly, our meeting about Gyros, Clocks, Interferometers, : Testing 
General Relativity in Space held from 21 to 27 August 1999 in Bad Honnef and 
these proceedings are devoted to these topics: 

• The Lense-Thirring effect 

• The detection of gravitational waves 

• Testing the Equivalence Principle 

• Clocks and rods in gravitational fields 

• Quantum tests of gravity 

• Electromagnetic field and gravity 

We want to present a review of the status of experimental gravity at the 
beginning of the 21st century. In doing so, we first tried to give a theoretical 
understanding of the various effects followed by reports about the status of 
the various experimental projects. We got the impression that this combination 
of theoretical and experimental talks gave a more complete picture of what 
is going on and contributed to the mutual understanding of theoreticians and 
experimentalists. 

Finally, we would like to express our gratefulness to the WE-Heraeus Founda- 
tion for financing this meeting and the Deutsche Forschungsgemeinschaft (DEG) 
for giving support to many of our colleagues from Eastern Europe. 



Konstanz and Diisseldorf, Stanford, Cologne 
October 2000 



Glaus Ldmmerzahl 
Francis Everitt 
Friedrich W. Hehl 
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An Overview of Solar System Gravitational 
Physics: The Theory— Experiment Interface 

Kenneth Nordtvedt 

Northwest Analysis 

118 Sourdough Ridge, Bozeman, MT 59715, USA 



Abstract. If the gravitational metric tensor field of Einstein’s General Relativity is 
supplemented by other long range, very weakly coupled interaction fields, then Gen- 
eral Relativity’s Equivalence Principle foundations are violated, or its post-Newtonian 
(1/c^ order) structure is altered, or both. Space experiments test for and measure such 
possibilities; presently universality of free fall is confirmed to about a part in 10^^, and 
no deviations of post-Newtonian metric gravity from general relativity are seen down 
to the few parts in 10'* level. Future experiments in space can significantly increase 
the precision to which fundamental physical law is probed. In particular, transponded 
interplanetary laser ranging can measure presence of metrically coupled scalar fields in 
gravity with two or three orders of magnitude higher precision than past experiments, 
and can begin to measure the second post-Newtonian (1/c*) structure of gravity. A 
space-based experiment of the universality of free-fall (STEP) can detect additional 
interactions of a non-metric nature with five or six orders of magnitude higher precision 
than today’s experiments. 



1 Introduction 

In General Relativity theory the gravitational interaction between matter is 
transmitted via the ten components of a symmetric tensor field While 

the space and time gradients of these ten potentials determine the local gravi- 
tational equations of motion of matter, the comparative values of the potentials 
between different locations also establish global relationships between the rates 
of clocks and spans of rulers which are substantially separated from each other. 
This creates, not simply deforms, the metrical properties of space and time — 
the cosmic arena — which are not a priori or by default Euclidean, but rather are 
dynamical and dependent on the distribution of matter throughout the cosmos 
and the gravitational field for their existence [1]. This historical progression in 
our understanding of the the cosmic arena’s nature — from Newton’s separate 
and rigid absolute time and absolute Euclidean space, to Minkowski’s united, but 
still globally rig id space-time arena, to the metric gravity field’s creation of a 
dynamic arena shaped by its material content — is illustrated symbolically by 
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Various experiments in space have contributed to the systematic mapping 
out of the long range gravitational interaction in the presence of source mat- 
ter distributions of ever-increasing generality and complexity. The experimental 
goals for the future are not only to continue improving the precision of this map- 
ping, but also to deliberately search for any additional long range force fields 
which may supplement Einstein’s tensor gravity by acting between bodies and 
thereby alter the details of the total in teraction. 

The major achievements of physics in the 20th century depend not only on the 
two major revolutions of relativity and quantum physics and their applications, 
but also on the elimination of action-at-a-distance from physical law and its 
replacement with the field theory of interaction — that matter at one location 
acts as source of certain dynamical fields, which once produced then propagate in 
space and time, and then interact with other matter located elsewhere. Maxwell’s 
equations for the electromagnetic field in presence of sources, plus the force law 
of Lorentz which gives the response of charges (sources) to those fields, formed 
the breakthrough model for this paradigm over a century ago 
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and includes all the basic ingredients for this new conceptual framework. A 
compact way to formulate the electromagnetic interaction is in terms of a vector 
field of potentials A^(r, t), p = 0,x,y, z 
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with indices i,j, k ranging over the three spatial dimensions, and being the 
anti-symmetric permutation matrix. Today’s so-called Standard Model for the 
electromagnetic, weak, and strong nuclear forces in matter is built upon the re- 
peated use of vector fields of interaction, with there being three additional fields 
for the weak force, IT)) , and eight ’gluon’ fields for the strong force, Gy, . These 
vector fields have been successfully Integra ted into quantum physics (involving 
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the careful handling of still not fully understood renormalizable infinities), but 
the metric gravitational field of General Relativity has not been so integrated, 
which is one hint that something is still missing in our theory of gravity. (It 
should be noted, however, that the post-Newtonian ’near fields’ of gravity which 
accompany source bodies contribute to matter’s quantum mechanical Hamilto- 
nian; otherwise body center-of-mass wave packets would not follow the post 
-Newtonian classical general relativistic trajectories which we observe in various 
experiments.) 

The simplest type of field, the scalar field, presently plays no role in modeling 
observed phenomena, yet theoretical speculations beyond the Standard Model 
suggest it should be a natural partner of the metric tensor gravitational field. 
A scalar field can, in fact, combine with the tensor field to jointly produce a 
hybrid metric field for the space-time arena. Because of an interesting equality 
between static-limit tensor and scalar source strengths for bodies of negligible 
external world stresses, gravi tational self-energy, and deviation from internal 
equilibrium. 



J T^dPx = j Td^x (4) 

is matter’s stress-energy tensor, T is its trace), such bodies fulfill compos- 
ition-independence of free fall rates and clock rate shifts in presence of other 
bodies. Presence of the scalar field in this case only manifests itself in post- 
Newtonian phenomena. The search for a ’metric’ scalar field in gravity has 
therefore been, and continues to be, one of the main themes of experimental 
gravity in space. 

A scalar field can not only alter the phenomena of metric gravity if coupled 
to matter in the special manner indicated above, this field also has the possibility 
(indeed the probability, according to string theorists!) of coupling differently or 
selectively to the various (electromagnetic, weak, or nuclear) scalar attributes of 
matter, and consequently violating Einstein’s Equivalence Principle by produc- 
ing composition-dependence of both gravitational free fall rates and clock rate 
shifts in proximity to m atter (gravitational ’red-shift’). 

Other very weakly coupled, long range force fields may also be present, wait- 
ing for discovery in more precise space experiments. For instance, a vector field 
Bfj, coupled to some combination of baryon and lepton number would also gen- 
erate composition-dependent accelerations of bodies. 

From what has been experimentally confirmed to date, it is not likely we’ll 
find a breakdown (in long range phenomena) of Einstein’s general relativistic 
equations for the dominant tensor field gravitational interaction 

GO = (5) 

but this still leaves open the question of whether there are any supplementary 
long range and very weak interactions between bodies? 
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2 Testable Models for Experiments 



The ingredients for modeling the space experiments consist primarily of the 
equations of propagation of electromagnetic signals, the rate equations of clocks, 
and the equations of motion for bodies. The totality of all the long range interac- 
tions in physical law will contribute to one or more of these dynamical equations. 
Clocks and bodies exist with a wide variety of internal compositions and struc- 
ture. This affects how they can couple to external force fields, and our models 
must take this into account. Some useful representations of these key dynamical 
equations are as follows: 



c(r, t) = Coo ( 1 - (1 -k 7 ) 

^ _ I , Iv^ u(r,t) 

dTc - I 1 2 c2 c2 
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(7) 



in which U{r,t) is the Newtonian potential function of proximate matter 
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The gravitational clock rate shift parameter ac has the universal, composition- 
independent value of one in metric theories, but more generally it may depend 
on the nature of the clock if other interaction fields supplement the tensor metric 
field of general relativity. The Eddington parameter 7 which appears in the light 
propagation equation equals one in general relativity but has smaller values 
in scalar-tensor metric gravity, for example. The light equat ion leads to two 
measurable phenomena: the deflection of light’s direction of travel through angle 

50 = 2(1 + 7)^ (9) 

when making a complete passage by a central body M with distance of closest 
approach D, and a delay in integrated time of propagation 



St = {1 + 7 ) 







( 10 ) 



when a signal travels between two sites at distances i?i and R 2 from the central 
body and passes at distance of closest approach D from that body. In the last 
year or so, analysis of very long baseline interferometry (VLBI) observations of 
deflections of light from very many astronomical sources has produced the most 
precise measurement of the light propagation equation, 

|7-1|VLBI < 6x 10-4 (2a), (11) 



whereas the time delay effect had for the previous couple decades supplied the 
premier experimental data for this measurement. 
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Accepting that local gravitational physics is Lorentz-invariant and Lagrange- 
ian~based with energy, momentum, and angular momentum conservation laws, 
foundations empirically established by the very space experiments under discus- 
sion in this report, the gravitational equations of motion for the general A^-body 
solar system, valid not only in Einstein’s pure tensor gravity, but also generalized 
to represent plausible alternative metric theory possibilities by introduction of 
the two Eddington para meters 7 and (3, is 



A 

B 
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with i,j,k = l,...,N,Vi = dvi/dt, ai = dvi/dt, and nj = \ri — rj\. A factor 
1/c^ multiplies all the post-Newtonian lines B through F, but has been set 
equal to 1 to simplify presentation. This total equation is very close to Eq. (1) 
of reference [4]; I have simply organized the terms of the equation of motion 
by type on separate named lines A, . . . ,F for purposes of discussion (see page 
321). And on line F, for purposes of stressing the inertial nature of these forces, 
I have retained the proportionality of these forces to the body acceleration 
which in reference [4] are set equal to their Newtonian values. Hj = GMj are the 
body gravitational mass parameters, and the Newtonian acceleration functions 
are indicated 

9ij = ^ '"'ji ■ (13) 

ij 

Setting 7 = /3 = 1, all body factors M{G)/M{I) = 1, and G equal to a con- 
stant as in general relativity, this becomes the equation of motion which JPL’s 
computer ephemeris uses to integrate and produce the default orbits of the planets 
and satellites in the solar system. Deviations from this nominal general relativis- 
tic case then produce experimental signals proportional to scientific parameters 
7 — 1, /3 — 1, M{G)/M{I) — 1, dG/Gdt, etc., which can then be measured in 
various experim ents by least squares fit type data analysis techniques. 
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Two necessary modifications of the Newtonian acceleration appear in line 
A of Eq.(12). In alternative theories of gravity the local Newtonian coupling 
strength G will generally be changing in some proportion to the universe’s Hub- 
ble expansion rate JI [ 2 ] 

G(t)^Go + G (t-to) + ... , with G/G ^ (4/3-3- j)H. (14) 



In such theories the necessary and consistent application of the total equation 
of motion to the internal gravitational structures of the bodies and their inter- 
actions with the gravity of the external world also alters the gravitational to 
inertial mass ratio of each celestial body in proportion to its internal gravita- 
tional binding energy [3] : 



f M(G) \ 



= 1 - (4/3 - 3 - 7) 



1 



2M,c2 



Gp(xi)p(y,) 

\xt - Vi\ 



d Xi d yi . 



(15) 



Celestial bodies, in this general situation, will therefore not fall in gravitational 
fields at test body rates. Measuring this ratio for the Earth by fitting the lunar 
laser ranging (LLR) data has been one of the key methods for measuring the 
metric theory, Eddington coupling parameters 7 and (especially) j3 which both 
take values different from their general relativity values of one in scalar-tensor 
theories [4,5]. 

Different terms of the general equation of motion given by Eq.(12), either 
individually or collectively, play roles in determining the different so-called ‘ef- 
fects’ of post-Newtonian gravity — perihelion precession, deSitter precession, 
Lense-Thirring precession, etc. But it must be stressed that all effects must 
be calculable from a common underlying equation of motion valid for the gen- 
eral case. The equation of motion can not be altered to ‘fit’ one effect without 
considering how that affects the ‘fit’ to other observed effects. 

It is worthwhile to exhibit the static limit force produced by an additional 
long range, very weakly coupled force 

F, = if, V, ^ ^ (16) 



The coupling strengths of this interaction, Ki, generally will not be proportional 
to the mass-energy contents of the interacting bodies, causing violations of key 
predictions of the Equivalence Principle. For example, two bodies A, B will now 
accelerate at different rates toward another source body S located at distance 
R away 



\aA — osj 



\9s\ 



Ks 

GMs \Ma 



Mb) 



(1 + yR) e-f^^ . 



(17) 



And atomic clocks whose frequencies are determined by the energy differences 
between two quantum states m, n, 



d3n 

2nh 



(18) 
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will now generally acquire anomalous, clock-dependent frequency shifts when in 
proximity to a source body, thereby modifying the universal rate shifts predicted 
by the Equivalence Principle and metric theories of gravity 

Sfnm ^ _ GMs f _ 
fnm C^R V 

These expressions for the two types of Equivalence Principle violations al- 
low comparisons to be made between experiments involving clocks and those 
measuring gravitational free-fall rate differences among assorted materials, with 
the purpose of noting their relative abilities to detect any new interaction with 
coupling strengths Ki. These expressions can also guide choice of experimental 
materials and the design of special clocks in order to maximize experimental 
sensitivities to specific models for ne w interactions. 

Even if the two quantum levels of an atomic clock have common coupling 
strengths = Km, absolute clock rate shift experiments could measure the 
presence of such an additional interaction by comparing the strength of the 
clock shifts now proportional to only GMs to the effective Newtonian coupling 
strength of the same central body with some planet P, which has now become 

Ps = GMs + Ks^ {I + . (20) 

If the additional interaction given by Eq. (16) has finite range, then its con- 
tribution to the periastron precession rate of orbits of size comparable to the 
interaction range l//i provides a very sensitive means for its detection. For or- 
bits of small eccentricity, the extra acceleration term from Eq. (16) generates an 
orbital precession rate of 




u;(orb) 2 ’ 



(21) 



The orbits of satellites and celestial bodies of size from about an Earth radius 
and up to the orbit of Mars have been measured with high precision using laser 
or radar ranging techniques. The absences of any anomalies in the precession 
rates of the various orbits fit to the data give very strong constraints on the 
presence of finite range interactions. These constraints are shown in Fig. 1 by 
the four labeled curves, and they apply for the supplementary interaction being 
either metric or non-metric. The s cientific significance of orbital precession 
rate measurements has not been fully appreciated in the past, and their future 
improvement over the total range of available orbits can be an effective way to 
enhance the search for new interactions. 

The full content of the equation of motion given by Eq. (12) includes the 
grouped contributions to the equation given in lines marked A through F. A 
rich variety of post-Newtonian phenomena results, including 



• rescaling of the inertia of a body or its mass elements in proportion to the 
strength of the gravitational potential produced by the other bodies and 
mass elements of the system (line F), 




10 



K. Nordtvedt 



• accelerations of a body due to motion of either the acted-upon body — vf 
— or the acting body — v'^ (line D), 

• non-linear accelerations proportional to the second power of source body 
masses or to the product of source (acting) bodies and acted-upon body 
masses (line B), 

• gravito-magnetic accelerations proportional to the velocities of the acted- 
upon as well as the acting bodies — ViVj (line C), 

• inductive accelerations of a body i in proportion to the acceleration aj of 
other nearby bodies j (line E). 

From the entire collection of experimental observations, these different parts 
of the total equation of motion are all presently confirmed and measured to bet- 
ter than a part in 10 ^ precision, and as well the local Lorentz invariance and 
conservation laws foundations are empirically established to even higher preci- 
sion. The LLR experiment has proven to be a particularly comprehensive probe 
of the equation of motion; there are measured lunar orbit perturbations affected 
by every part of the general equation (A , ... , F). By contrast, the initial historic 
success of general relativity to explain the anomalous precession of perihelion for 
Mercury’s orbit results from the special case of the general Eq. ( 12 ) in which a 
single test body (Mercury) moves in the static, spherically symmetric gravity of 
the Sun 



, 1 2 

1 + 2 ^ +( 27 + 1 )^^) 



a + V {v ■ a) 



( 22 ) 



= ( 1 - (2/3 - 1) + (2y -f l)v^ ) g-{2j+l)v{v-g) 



(here again, 1/c^ has been set equal to one in the post-Newtonian terms), and 



g = 



GM 



r . 



(23) 



3 Lunar Laser Ranging 

Modeling of the LLR experiment, which includes a moving and accelerating 
Moon orbitting a moving and accelerating Earth, both bodies in the combined 
gravitational field of the Sun, of each other, and of themselves, results in LLR be- 
ing a comprehensive laboratory for testing the total structure of the gravitational 
equation of motion for bodies (LLR also tests the more general parameterized 
metric field equation of motion developed by Will and Nordtvedt [8,9]). The 
measured Earth-Moon range is sensitive to literally every one of the phenomena 
in the body equation of motion described above. From the fit of the range data, 
several types of terms in the general relativistic equation of motion are today 
confirmed with precisions of at least a few parts in 10"^. But there are three LLR 
measurements in gravitational physics which are of particular scientific interest. 
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1. Earth and Moon are found to fall toward the Sun at equal rates to three 
parts in 10^^ precision. This confirms both the non-linear self-coupling of 
gravity to itself, 

1/3 - 1 Illr < 2x 10-4 (24) 

and the composition-independence of gravity’s coupling to matter (Earth 
and Moon have somewhat different average chemical compositions). 

2. Inertial space in the locality of the Earth-Moon system is found to rotate 
relative to distant inertial space at the rate predicted in General Relativity 
— deSitter’s geodetic precession — to precision of three or four parts in a 
thousand. 

3. Newton’s G shows no time variation at the few parts in 10^^ per year level. 

For more detail on the LLR experiment and theory, see the other contribution 
to these proceedings. 

4 Geodetic Precession and NASA’s Gravity Probe— B 

When a system moves non-radially at velocity V through the gravitational field 
of an external body M, the accelerations of the system’s individual bodies 
moving in that system at the additional velocities Vi experience Coriolis-like 
acceleration terms equal to 

<ia, = 2 ^ (HxV)x^,. (25) 

This indicates that the local inertial frame moving with that system rotates 
with respect to the distant inertial frame at the deSitter geodetic precession rate 
(about 19.2 mas/year) 

«dS= (7+i)^Hx V, (26) 

LLR presently measures this rotation of local inertial space to good precision 
through its .07 mas/year fit of the perigee precession of the lunar orbit. NASA’s 
GP-B mission of Earth-orbitting gyroscopes, scheduled to be launched early 
2002, is designed to measure this precession due to motion through Earth’s 
gravity with two orders of magnitude more precision; if successful this will yield 
a very significant improvement in our knowledge of the fundamental metric field 
Eddington p arameter 7 which enters Eq.(26). 

5 Interplanetary Laser Ranging 

Two decades ago NASA’s Viking mission to Mars included transponded radar 
ranging to both a spacecraft orbitting that planet and a lander. From the indi- 
vidual range measurements of about 7 meters precision, and relying especially on 
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those observations made when the ranging signals passed close to the Sun, the 
propagation time delays given by Eq. 8 were fitted, and the Eddington 7 param- 
eter was measured to precision of 2 x 10“^. Transponded laser ranging between 
Earth and Mars co uld be performed in a future mission with precision compara- 
ble to that of LLR — of order centimeters. By fitting the laser pulse propagation 
times, and also taking into account the perturbations of the Mars and Earth or- 
bits which emerge from the relativistic equation of motion, both parameters 7 
and f3 can be measured to a part in 10®. For this purpose, a key perturbation 
results from the 4-body problem involving Sun, Jupiter, Earth and Mars. If the 
Sun accelerates anomalously toward Jup iter because its gravitational to inertial 
mass ratio differs from one at either the first or second post-Newtonian order (y 
is a collection of second post-Newtonian order metric field expansion coefficients 
which sum to zero in general relativity). 



M{G) 

M{I) 



^ ^ '’2Mc^ J \x-x'\ 

^ r p{x)p{x')p{x") 

J \x-x'\\x-x"\ 

^ 1 - 4 X 10 "® (4/3 - 3 - 7 ) -k 7 X 10-11 y , 



(27) 



then the orbit of an inner planet is polarized toward Jupiter by amount 



( M{G) A Mj 3f?p - Gj flj 

Y j g Mg 2f2p — f2j Qp — [}j 



(28) 



in which fip and Qg refer to the orbital frequencies of the planet and Jupiter, 
respectively, and Rj is Jupiter’s orbital radius. The Earth and Mars polarizations 
then produce a unique time-dependent Earth-Mars range signal. As seen in 
Eq.(28), replacing Mars by a more distant asteroid as the ranging target would 
enhance the effect and should be considered as a mission option. Laser ranging 
to a space-based gravity wave observatory like LISA will also show this orbit al 
perturbation. Although this may be the only relativistic effect in the solar system 
involving second post-Newtonian order (l/c"i) features of the gravitational field 
which can be measured in the near future, theoretical work suggests that the 
precision measurement of the first post-Newtonian parameters 7 and /3 which 
will result is of more importance than the novel but low precision measurement of 
the second post-Newtonian order parameters. An experiment at this level could 
be a decisive type of solar system experiment for probing metric gravity. It has 
been shown that scalar-tensor metric gravity, which during the early universe 
may have contained comparable coupling strengths of these two interactions with 
matter, would within some general scenarios find the cosmic background scalar 
field dynamically evolving as the universe expands toward a field value where the 
scalar field’s coupling to matter measured by 1 — 7 and 1— /3 has naturally become 
very small, but not exactly zero, in our era [ 6 ]. (A similar scenario has been 
studied in non-metrically coupled scalar-tensor theories, giving a motivation 
for testing, for example, the composition-independence of gravitational free fall 
beyond the present-day part in 10^^ confirmation [7].) 
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Constraints On ‘New’ Force Strength 




Yukawa Range - l//i - km 



Fig. 1. The solid cnrves marked “LAGEOS”, “LLR”, “Mercury”, and “Mars” show 
constraints on the strength of any metric or non-metric force which may snpplement 
general relativistic gravity, plotted as fnnction of Ynkawa range. These constraints re- 
snlt from measnrements of the periastron precession rates of the corresponding orbits. 
The LLR cnrve is double humped, reflecting the roles of both Earth and Sun in de- 
termining the lunar orbit. The dashed curves marked “LLR-EP” and “L LR (Metric 
EP)” are constraints from the measnrement of the equality of the Earth’s and Moon’s 
acceleration rates toward the Sun. This result has separate interpretations for the cases 
of the supplementary force being metric or non-metric, and these constraints extend to 
‘inhnite’ range, i.e. inverse-sqnare. The dashed curve marked ’’STEP” shows the level 
of very weak non-metric force which can be detected by that futnre space experiment. 



6 A STEP Experiment 



From laboratory experiments and lunar laser ranging, we presently know that 
differently composed bodies accelerate in a gravitational field at rates identical 
to about a part in 10^^. From Eq.(12), and noting that various attributes per 
unit mass of atoms Ki/Mi fractionally differ by about a part in 10^ as one selects 
from different regions of the periodic table of elements (baryon number, nuclear 
electrostatic energy, etc.), it can be concluded that most any supplementary long 
range interaction must be at least a factor of about 10® weaker than the strength 
of gravity (for certain types of coupling of the supplementary field, the constraint 
would be stronger) 






GMiMj 



< 10 "®. 



(29) 
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This experimental constraint is shown by one of the curves in Fig, 1. Exploit- 
ing several favorable features of the experimental environment in an orbitting 
spacecraft, a STEP (Satellite Test of the Equivalence Principle) mission is being 
designed to measure differences in the gravitational free fall rates of a variety 
of elements toward Earth with precision of a part in 10^®. This will extend 
the search for new long range interactions by five or six orders of magnitude 
higher precision for interaction ranges from an earth radius up to infinity (in- 
verse square). This STEP goal is shown by another curve in Fig.l. Present-day 
investigations of unified theories beyond the Standard Model, such as string 
theories, suggest that such additional interactions may exist, especially a scalar 
field coupled selectively to the different scalar attributes of matter. A STEP-like 
experiment is one of the very few ways we may be able to search for and possi- 
bly find consequences of these under lying theories in our ‘low energy’ realms of 
experience. Even a discovery by STEP of no composition-dependent accelera- 
tions at these incredible levels of precision would place strong constraints on and 
guidance for the theoretical investigations working toward unification of physical 
law. 
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Abstract. The Newtonian gravitational constant G was the hrst known fundamental 
constant of physics. Nevertheless, the measurement of its value still seem to be in 
a rather sad shape. Recently, the CODATA Task Group on Fundamental Constants 
recommended a preliminary value of G with a relative uncertainty of 0.15%. This is 
more than ten times larger as the previous recommendation! 

In the first part of this lecture, a brief summary is given of recent experimental 
efforts to determine G. The second part is a description of our experiment at the 
University of Zurich. 

We use a beam balance to measure the gravitational forces of large field masses 
(13.5 X 10® kg mercury) on 1kg test masses. A first result with an uncertainty of 
220 X 10~® has been published recently. Presently we are working to improve the 
experiment. 



1 Introduction 

Henry Cavendish published in 1798 - more than 200 years ago - his famous 
paper titled “Experiments to determine the density of the earth” [1]. This ex- 
periment is the first laboratory experiment to determine the Newtonian gravi- 
tational constant G. Cavendish employed a torsion balance, a device which, up 
to now, is widely used to measure G. Although the measurement techniques im- 
proved considerably during the last 200 years, CODATA® in 1998 recommended 
an uncertainty of 1500 ppm and a value of G = 6.673(10) x 10“®® m®kg“®s“^ 
[2,3]. 

Compared to the previous CODATA recommendation of 1986 [4], the uncer- 
tainty was increased by more than a factor of ten. The reason for this surprising 
step can be seen in Fig.l, showing the results of recent measurements. There 
are large and apparently not well understood discrepancies. The CODATA task 
group on Fundamental Constants decided to recommend the old 1996 value of 
G, but to drastically enlarge the recommended uncertainty. 

On the positive side, these discrepancies provided motivation for many new 
experiments. Several of these new results were presented at a conference held in 
London 1998 to celebrate the 200th anniversary of the Cavendish experiment. 

® The Committee on Data for Science and Technology of the International Council of 
Scientific Unions. 
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Fig. 1. Latest results of published G experiments and the recommended CODATA 
values of 1986 [4] and 1998 [2,3]. 



2 Currently Running Experiments 

We will restrict our discussion to earth bound experiments. There are also in- 
tentions to perform measurements of G in space [5]. However, these efforts seem 
to be in an early stage and are not yet fully funded. 

The present experiments are conveniently divided into two groups, those 
using a torsion balance and other experiments using a variety of techniques. In 
all experiments there are movable field masses producing an attractive force on 
relatively small test masses. Many techniques have been developed to sense the 
effect of these small forces. We will first discuss techniques employing a torsion 
balance. 

Cavendish operated his torsion balance in the static deflection mode. A 
dumbbell was suspended by a torsion wire and the angle of deflection was mea- 
sured, when two held masses were brought nearby. This method is not used 
anymore. Instead three other techniques are employed today. 

First, there is the time of swing method. The angular frequency of a torsion 
pendulum is measured with and without held masses. The torque caused by 
the gravitational attraction of the held masses increases the angular frequency 
when compared with the free pendulum. The difference of the squared angular 
frequencies is proportional to the second derivative of the gravitational potential 
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Table 1. Summary of the presently running experiments employing a torsion balance 
to determine the Newtonian gravitational constant. 



group 


method. 


status 


value 


rel. 




features 




for G 


uncer- 








[10-“m3 

kg-^s-^] 


tainty 

[ppm] 


M. P. Fitzgerald, 


force compensation, 


working on 


6.6746 [12]a 


150 


T. R. Armstrong [12], 


electrometer 


improvements 


6.6742 [12]b 


105 


New Zealand 


compensated 








J. H. Gundlach[13], 


acceleration, 


under 






Washington 


optimized mass 
arrangement 


construction 






O. V. Karagioz, 


time of swing 


published 


6.6729 


75 


V. P. Izmailov [27], 
Moscow 




results 






J. Luo 


time of swing 


published 


6.6699 


105 


et. al [29] 

People’s Rep. of China 




results 






G. G. Luther, 


time of swing 


old published [31] 


6.6740 


105 


C. H. Bagley 




new under constr. 






Los Alamos 




[32] 






R. D. Newman, 


time of swing. 


under 






M. K. Bantel [8], 


cryogenic, optimized construction 






Irvine CA 


mass arrangement 








S. J. Richman 


time of swing and 


first 


6.683 


1700 


et al. [11], 


force compensation 


results 






BIPM“ 


in one apparatus 









Bureau International des Poids et Mesures 



U of the field masses [6]: 



G = 




where 



n _ 1 

“ G ^ ■ 



The angle 9 is formed by the field masses and the axis of the pendulum and I 
denotes the moment of inertia of the dumbbell. This equation is valid under the 
assumption that the elastic coefficients of the torsion wire is independent of the 
angular frequency. However, in 1995 Kuroda [7] pointed out that inelasticity in 
the fiber could produce a systematic shift of the frequency. Newman and Bantel 
showed [8], assuming a certain model for the inelasticity, that an upper limit 
for the induced relative error of G is given by + 0(Q~^), where Q is the 

quality factor of the torsion pendulum. It is thus advantageous to have a Q- 
value as large as possible. As Q can be increased by lowering the temperature, 
Newmann and Bantel intend to operate their pendulum at the temperature of 
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liquid helium. As shown in Table 1 there are five groups employing the time of 
swing technique. 



Table 2. Summary of the present running experiments to determine G without using 
a torsion balance. 



group 


method, 

features 


status 


value 

[IQ-^m^kg-^s^] 


rel. uncer- 
tainty [ppm] 


U. Kleinevofi 


Fabry-Perot 


working on 


6.6735 


432 


et al. [30] 


resonator 


improvements 






Wuppertal 


(microwaves) 








A. De Marchi [17] 


dynamic 


first 






et al. 


free pendulum 


results 






Torino 










Wei-Tou Ni 


Fabry-Perot 


under 






et al. [33] 


resonator 


constrnction 






Taiwan 


(optical) 








F. Nolting 


beam balance 


working on 


6.6749 


217 


et al. [26] 




improvements 






Zurich 










H. Parks 


Fabry-Perot 


under 






et al. 


resonator 


construction 






Colorado 


(optical) 








J. P. Schwarz 


free fall 


published 


6.6873 


1400 


et al. [14,15] 


method 


results 






Colorado 











The second technique is called force compensation method. A electromag- 
netic force is applied to the torsion pendulum in order to prevent the torsion 
fiber from twisting. The electromagnetic quantity, for example the applied volt- 
age on an electrometer, is measured. Thus a calibration has to be done, i. e. the 
measured electromagnetic quantity must be converted into a mechanical force or 
torque. On the other hand, these experiments are static in principle and therefore 
should not be affected by inelasticity effects. Two groups have exclusively used 
this methods and both results showed large discrepancies [9,10] when compared 
to the CODATA value. Although unlikely, one may suspect that this could be 
an indication for some fundamental problem. For this reason the group at the 
Bureau International des Folds et Mesures (BIPM) [11] built a torsion balance 
where both the time of swing and the compensation method can be used. At 
present only preliminary measurements with the first method have been per- 
formed. The group is working on improvements and it is hoped that a accurate 
comparison of the two methods should soon be possible. The group in New 
Zealand has discovered a fault in the analysis of their 1995 measurement [10]. 
In 1999 they reported a corrected value [12] and a new and more precise result. 
Both are in agreement with most other results. Unfortunately the experiment of 
Michaelis et al. [9] has been discontinued some time ago. 
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The third important technique is the so-called acceleration method where the 
torsion balance is mounted on a turntable. This turntable is accelerated and con- 
trolled by a feedback loop in such a way that the torsion fiber does not twist. The 
angular acceleration of the turntable can be measured directly and the torque 
produced by the field masses be calculated. Furthermore, the measurement is 
static again and the experiments should not be affected by inelasticity. Obvi- 
ously, this technique requires high precision turntables and angular encoders. 
One such experiment is presently set up by Gundlach [13] at Washington. 

Altogether there are at present seven torsion balance experiments, which are 
listed in Table 1. Five groups have published a result. 

There are presently six experiments measuring G without a torsion balance. 
In 1998, J.P. Schwarz published a interesting free-fall experiment [14,15]. The 
test mass was a corner cube reflector falling vertically through the center of a 
doughnut shaped field mass. The acceleration of the test mass was determined 
by tracking its position using an optical interferometer. The force on the test 
mass was modulated by periodically moving the field mass in vertical direction. 
The field mass was made by tungsten because of its high density. The published 
result is given in Table 2. 

Three experiments use a Fabry-Perot resonator. The resonator is formed by 
two pendulums which act as test masses. The field masses are moved horizon- 
tally. Their gravitational force changes the relative distance of the two pendu- 
lums, proportional to the horizontal force gradient. The distance is measured 
by the eigenfrequency of the resonator. This technique was pioneered by the 
Wuppertal group [16]. They are using microwaves with typically 20 GHz, i.e. a 
rather large wavelength compared to the distance change of the two pendulums, 
which is about 10 nm. The measurements reported are very precise and the latest 
published result (see Table 2) is dominated by systematic uncertainties. There 
are two other experiments using this Fabry-Perot technique. They intend to use 
visible light, that is a much shorter wavelength. Both experiments are presently 
under construction. 

Another interesting experiment is performed by De Marchi et al. in Italy [17]. 
They exploit the fact that frequencies can easily be measured with very high 
precision. A simple pendulum, mounted in vacuum, is used and the frequency 
change is measured when a field mass is brought nearby. 

The experiment at the University of Zurich uses a beam balance and will be 
described in more detail below. 

All present experiments not using a torsion balance are listed in Table 2. 
A detailed review of the experiments to measure the Newtonian gravitational 
constant can be found in [18]. 

3 The Principle of Our Experiment 

The first experiment to measure the gravitational constant with the help of a 
beam balance was published in 1879 by J.H. Poynting [19]. Subsequently several 
similar measurements were performed but soon this technique was given up in 
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Fig. 2. The principle of the experiment. The two field masses are shown in the two 
positions as used for measurements. 



favor of the torsion balance. In 1983 the first modern beam balance experiment 
was done by C. Speake et al. [20] to search for the “fifth force”. The University of 
Zurich group used a beam balance in a experiment performed at the Gigerwald 
storage lake [21,22]. There, the gravitational constant was measured at an effec- 
tive distance of approximately 100 m. Since 1997 we are using a beam balance 
to measure G in a laboratory experiment. 

The principle of our experiment is shown in Fig. 2. There are two cylindrical 
field masses and two test masses. The field masses can be moved vertically and 
have a central hole such that the test masses can pass through. In position one 
the field masses are close together, while in position two they are apart. The two 
test masses remain at the same position but can be connected alternately to the 
balance. 

First the weight difference between the test masses in position one is mea- 
sured. Obviously, the weight of the upper test mass is increased by the grav- 
itational force of both field masses, while the weight of the lower test mass is 
decreased. The weight difference, defined as the difference in the readings of the 
balance, weighing first the upper test mass and then the lower test mass, is in- 
creased, when compared with weights of the test masses in absence of the field 
masses. 

In position two the situation is reversed. The weight of the upper test mass 
is decreased, while the weight of the lower test mass is increased. 
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Fig. 3. Schematic illustration of the force acting on a test mass on the axis of a field 
mass with the form of a hollow cylinder. 



The second difference, that is the difference of the weight differences for field 
mass positions I and II, is independent of small but unavoidable mass differences 
of the two test masses and directly proportional to G. All effects which are the 
same for the two test masses, such as tidal forces and zero-point drifts of the 
balance, cancel already in the first difference. 

The force field of a cylinder with a central hole has an advantageous feature 
which is exploited in this experiment. The z-component of the force F^, produced 
by one cylinder and acting on a small test mass on the symmetry axis, is shown 
in Fig. 3. There are two local extrema near both ends of the cylinder, which in 
fact are saddle points. This is shown in more detail in Fig. 4, where is plotted 
as a function of the radial distance and the z-coordinate. The plot was generated 
assuming the dimensions of one field mass as used in the experiment. Clearly 
the accurate position of a test mass is quite uncritical if placed near the saddle 
point. For an assumed position error of 1 mm, the force error would be of order 
10 ppm. 

4 The Experimental Setup 

The experiment is located in a 4.5 m deep pit inside an experimental build- 
ing at the Paul-Scherrer-Institut (PSI) . The necessary thermal and mechanical 
stability is provided by the thick concrete walls of the pit. The experiment is 
divided vertically in two parts separated by a working platform. As shown in 
Fig. 5, the field and test masses are located in the lower part. The upper part 
of the experiment houses the balance, the electronics and the vacuum pumps. 
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r [mm] 
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Fig. 4. The relative deviation of the ^-component of the force with respect to its value 
at the saddle point. The force is generated by one homogeneous cylindrical field mass 
(see text). 



Both parts are mechanically and thermally insulated from each other and each 
room has its own temperature stabilization system. 

The balance is placed on a massive granite plate, supported by two steel 
girders. They are attached to the wall of the pit and have no direct mechanical 
contact to the working platform. The balance and the two test masses are inside 
a vacuum system consisting of a chamber and a long tube passing through the 
central hole of the field masses. At present the pressure in the system is typically 
10^“^ Pa. In that way problems due to convection and buoyancy are avoided. 

The most important part of the experiment is the balance. It is a modified 
Mettler-Toledo AT 1006 comparator balance. The commercial version of this 
balance was designed for a high precision comparison of 1 kg standard masses. 
A schematic view is shown in Fig. 6. It is a single-pan, flexure-strip balance with 
a servo-controlled beam. A weight change on the cantilever is compensated by a 
magnetic system, consisting of a permanent magnet and a coil mounted on the 
beam. The current through the coil is measured and converted into a weight. A 
more detailed description may be found in [23]. The original resolution of the 
balance was 1 pg and has been changed to a present value of 12.5 ng. Currently 
we achieve a reproducibility of 200 ng which we define to be the standard devi- 
ation of 10 weight comparisons. The balance is calibrated by putting small and 
precisely known standard masses (presently two 0.5 g) on the cantilever. Finally 
the balance reading is converted into a force using the measured value of the 
local gravity, g = 9.8072335(6) ms“^. 
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Fig. 5. The experimental setup. Legend; l=chamber, 2=thermally insulated chamber, 
3=the balance inside of the vacuum chamber, 4=concrete walls of the pit, 5=granite 
plate, 6=steel girders which support the balance, 7=vacuum pumps, 8=gear assem- 
bly, 9=motor, 10=working platform, ll=spindle, 12=steel girder of the main support, 
13=upper test mass, 14=field masses, 15=lower test mass, 16=vacuum tube. 



The field masses are vessels made of stainless steel and can be filled with a 
liquid. A liquid is used, because of its high homogeneity. Each vessel has a volume 
of 5001 and a mass of 800 kg when unfilled. The outer diameter of the field mass 
is 1046 mm, the inner diameter 100 mm and the height 700 mm. Measurements 
with empty and with water filled vessels have been performed and first results 
were reported in [24,25]. 

At present the vessels are filled with mercury. The total filled in mass (13.5 x 
10^ kg) was determined with a relative uncertainty of 2 ppm. A detailed descrip- 
tion of the filling procedure may be found in [23] . 

The 1 kg test masses were made from copper and are gold plated. Small sam- 
ples were taken from the same piece of material as the test masses. The magnetic 
susceptibility of the samples was measured and found to be sufficiently small 
such that magnetic forces should be negligible. Also the density homogeneity 
was checked and at the level of 2 x 10“^ no differences were found. 
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Fig. 6. Schematic view of the balance. 



5 The Measuring Procedure and First Results 

The whole measuring procedure is fully automated and computer controlled. 
Initially the field masses are in position one, that is together. One test mass is 
suspended on the balance and weighed. This lasts typically 2 minutes. Then the 
test mass is lifted from the balance and at the same time the other test mass is 
connected to the balance. This is done, using a special exchange device, in such 
a way that the total load on the balance does not change by more than typically 
a few grams. The second test mass is also weighed and then the sequence above 
is repeated typically 20 times. Next a standard mass is put on the balance for 
calibration. Then the vessels are moved in position two, that is apart and the 
complete weighing procedure above is done again. All taken together this forms 
one cycle of the measurement (see Fig. 7), which is then repeated indefinitely. 
Typical data are shown in Fig. 8. Plotted is the first difference as a function of 
time. The amplitude of the rectangular signal is the second difference and has a 
approximate value of 0.8 mg for mercury filled vessels. The inset shows the data 
of the last half-cycle with the vertical scale expanded by a factor 500. 

Measurements with empty and water filled vessels were carried out during 
1997 and the results were published in [24,25]. From these data the gravitational 
constant was found to be 

G = (6.6754 ± 0.0005 ± 0.0013) x 10"“ m^kg-^s"^ (water). 

The statistical uncertainty is 75 ppm and the systematic uncertainty is 209 ppm. 
The detailed error budget is reproduced in Table 3. 

Since the beginning of 1998 measurements with mercury filled vessels are 
in progress. A preliminary result of these measurements was published in [26]. 
During the mercury runs a systematic variation of the measured signal as a 
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Fig. 7. A typical example of a measurement cycle. Plotted is the reading of the balance 
for the two field mass positions. Initially - in the left part of the graph - the vessels 
are apart and the weight of the lower test mass is increased, the weight of the upper 
test mass is decreased. At the right, corresponding to the first field mass position, the 
situation is reversed. The smooth variation with time is caused by the drift of the 
balance. 

function of time was observed. In February of 1998 the results were slightly 
larger than in later months until November. Surprisingly, a similar effect was 
also observed in 1999. For this year we have data until May. Up to now, no fully 
satisfactory explanation has been found. To take this into account, an extra 
uncertainty of 80 ppm, which is one half of the observed variation, was included 
into the error budget. As a consequence, the result from the mercury runs is not 
more accurate than the water results, despite the larger signal. The published 
value is 



G = (6.6749 ± 0.0014) X 10 ^^m^kg ^s ^ (mercury). 

Here only the systematic uncertainty is given and the statistical uncertainty is 
comparatively negligible. 

6 Outlook 

The results of the measurements with the water filled vessels and the mercury 
filled vessels agree within their errors (see Fig.l). Our results are 420 ppm or ap- 
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Fig. 8. Measured weight difference from three days of data during 1999. The inset 
shows the difference of the last half-cylce expanded vertically by a factor 500. 



proximately two of our standard deviations larger than the CODATA value from 
1986. The error budget is dominated by a possible, but not yet fully investigated, 
nonlinearity of the balance. This large uncertainty arose, because the balance 
was calibrated with a 1 g standard weight, whereas the measured signal was only 
0.8 mg (mercury). We hope that with future work a significant improvement is 
possible. The next largest error was assigned because of the observed systematic 
variation mentioned above. This effect is difficult to understand, because most 
conceivable causes should precisely cancel in the result. In fact, we have not 
yet been able to identify a unique cause but believe that it is related to some 
instabilities in the experiment. At present we are working on a greatly improved 
temperature stabilization. Also sensitive tilt meters will be installed to monitor 
possible tilt changes of the balance. We hope to reach the design accuracy of the 
experiment of 10 x 10“® in the near future. 



Acknowledgment 

We thank the Paul-Scherrer-Institut for the helpful support. We wish to thank 
the companies Mettler-Toledo and metrotec for providing and modifying the 
balance. We gratefully acknowledge the work of the machine shop of our institute. 
Furthermore, we want to thank all the metrological institutes that have helped 





Determination of the Gravitational Constant 



27 



Table 3. Systematic uncertainties of the gravitational constant, measured with water- 
filled and with mercury-filled vessels as published in [26]. The uncertainties are grouped 
according to the contributions of the test masses, the vessels, the liquid and further 
effects. They are estimated at a confidence level of 68.3% and the total uncertainty is 
the root-mean-square value 



Z\G/G[10-6] 

Source of uncertainty Water Mercury 



test mass position 


11 


10 


test mass dimension 


2.7 


2.0 


density inhomogeneity 


< 1 


< 2.2 


masses of test masses 


0.27 


0.27 


joints and spindle drive 


16 


2.2 


shape and volume 


14 


2 


dimensions of the vessels 


11 


1.6 


density inhomogeneity 


< 5 


< 0.06 


masses of the vessels 


4 


0.5 


air displaced by the vessels 


7 


2 


density of the liquid 


15 


18 


mass of the liquid 


8 


0.7 


nonlinearity of the balance 


< 200 


< 200 


systematic variation 


- 


80 


sorption effect 


45 


6.4 


integration 


< 13 


< 5 


tilt effect 


< 18 


< 4 


calibration 


5 


8 


local gravity 


0.06 


0.06 


total 


209 


217 
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Abstract. A pedagogical derivation is given of the Lense-Thirring effect using basic 
notions from the motion of point particles and light rays. First, the notion of rotation 
is introduced using the properties of light rays only. Second, two realizations for a non- 
rotating propagation of space-like directions are presented: the gyroscope and the spin 
of elementary particles. Then the gravitational field around a rotating body is specified 
which is taken for determining the various effects connected with a point particle or a 
gyroscope: the deSitter precession (geodesic precession) and the Lense-Thirring effect 
(‘frame dragging’). The results are applied to the precession of gyroscopes and to the 
motion of satellites around the earth. 



1 Introduction 

In the Lense-Thirring effect three rotations are involved: the rotation of the 
gravitating body, the rotation of the test body around its own axis, and the 
rotation of the axis of rotation of the test body. The properties of the gravitating 
body are prescribed, the rotating test body can be shown to move approximately 
on a geodesics around the gravitating body, and its axis of rotation can be shown 
to be Fermi propagated along the path of the test body resulting in a precession 
of the axis of rotation with respect to distant stars. In this note we want to derive 
all these notions and equations of motion from scratch in order to indicate clearly 
that everything is provided by General Relativity: Everything follows from the 
geodesic equation for point particles and the validity of Einstein’s equations; 
we don’t have to use additional assumptions. At the end we will discuss several 
experimental approaches to test the various effects related to rotating bodies. 

In the following we (i) introduce the notion of rotation, (ii) derive the equa- 
tion of motion of the spinning axis of a rotating test body, (iii) derive the gravita- 
tional field of a rotating gravitating body, and (iv) use these results for discussing 
and analyzing the equation of motion of the test body and of its spinning axis. 

All of what we assume is that gravity is described by means of a Riemannian 
geometry endowed with a space-time metric g and that light rays and freely 
falling point particles move along geodesics of that metric, 

DyV = av , (1) 

where D is the unique metric compatible, Dg = 0, torsion-free, DyV — DyU — 
[■u, u] = 0, covariant derivative. In components, {Dy,vY = rc^(9^u'^ + { 
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with the Christoffel symbol { } := \g''^ {df^gpo- + d^gpp ~ dpg^a)- We do not 

assume any normalization condition of the 4-velocity. Light rays with tangents I 
obey the same equation of motion (1), but with the additional condition g{l, 1) = 
0. - In addition, we will assume Einstein’s field equations. 



2 Rotation 



In order to define the notion of ro- 



tation, we use the so-called zig-zag 
construction, or the bouncing photon, 
as introduced by Pirani [1]. This con- 
struction uses a central point particle 
which moves along an arbitrary path, 
see Fig.l. In the neighborhood of that 
central particle there are two other 
point particles equipped with a mir- 
ror. At first, the central point particle 
emits a flash of light which hits the 
auxilary point particles. These auxi- 
lary point particles reflect this flash of 
light in such a way that it again meets 
the central point particle and, in ad- 
dition, the other auxilary particle po- 
sitioned appropriately behind the cen- 
tral particle. Then the auxilarty par- 
ticles again reflect the flash of light so 
that it again meets the central parti- 
cle and the other auxilary particle, and 
so on, see Fig. 1. For this construc- 
tion we assume that the two satellites 
are “near” to the central particle which 
means that no curvature effects should 
be involved. This is a condition which 
can always be fulfilled. 

It is not assumed that the central 
particle moves along a geodesic. And 
even if the path of this particle is geo 




auxilary particle/ auxilary particle 
central particle 

Fig. 1. Geometry of the bouncing photon. 
Two auxilary particles communicate via 
light rays in such a way that all light rays 
(dashed lines) have to meet the central 
particle. No particle is assumed to be in 
geodesic motion. 

esic, then the two auxilary particles in 



general are not geodesic because they always have to be re-positioned in order 



to meet the above construction. 



In this way the particles define a time-like 2-surface, or, after projection 
into the rest space of the central particle, a direction in the rest space prop- 
agating along the path of the central particle. This construction defines the 
propagation of the direction of the light rays in the rest space of the central 
particle. It turns out that this propagation can be used to define the notion of 
a “non-rotating” propagation of a vector. This notion coincides with the notion 




The Lense-Thirring Effect 



33 



of Fermi-displacement which usually is used for the description of non-rotating 
propagation. However, in our case we get this notion from an operational proce- 
dure. 

We now turn to the mathematical description of this procedure. We make 
use of the equation of motion (1). The condition that the light rays with tangent 
l\ and I 2 lie in the same plane with the central particle u is 

u = a\li + U 2 I 2 , for some ui, (T 2 G R . (2) 

The condition that, after reflection, the light rays will cross the central worldline 
again, is secured by 

C 1 J 2 = + ^ 2 h ■, for some ei , C 2 G M . (3) 

Now we derive the equation which governs the transport of the directions 

Vi := PJi or H 2 := PJ 2 , (4) 

along u, where 

P,.A-.= A-!P^u (5) 

g(u,u} 

is the projection operator onto the rest space of the world line of the central 
particle. 

Using CTiUi = — (J 2 V 2 we get 

Pv^PuDuVi = —PviPuDu{(TiVi) = ^PviPuDu{(TiVi - 0 - 2 ^ 2 ) . ( 6 ) 

( 7 1 Z ( 7 1 

Inserting (2, 3, 4) and the equation of motion (1) for the light rays li and I 2 , we 
finally get 

Pv,PuDuVi = 0. (7) 

The expression U„U := PyPuDuV is the so-called Fermi-derivative of the vector 
V along u. Eq.(7) is invariant against reparametrization of the paths, so that 
it indeed describes the propagation of the direction V . This is precisely the 
characterization of the bouncing photon which we take as definition for a non- 
rotating propagation of a vector along a given path [1]. If for another vector 
W defined along P the above expression does not vanish, then the operator 17, 
which is a ( } )-tensor defined by PwPuDuW = f^{PuW), is called the rotation 
of W . This is the characterization of the notion “rotation” we announced. 

Now we turn to the question whether, beside this bouncing photon, there are 
other physical realizations of a non-rotating propagation. 

3 Equation of Motion for Angular Momentum 

There are indeed two further realizations for a non-rotating propagation which 
are experimentally easier to handle with than the bouncing photon: One real- 
ization is given by a rotating gyroscope possessing orbital angular momentum, 
the other is the elementary particle with spin. 
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3.1 The Motion of Gyroscopes 

The metrical energy-momentum tensor which, within Einstein’s theory, is 
the source of the gravitational field, is symmetric and divergence-free 

= 0 . (8) 

This is the equation of motion for matter in General Relativity. 

In the case that matter is 
fairly localized (that is, ^ 0 
for a compact space-like region 
only, within which the gravita- 
tional field in terms of the cur- 
vature does not vary too much), 
it is possible to extract from this 
tensor a center-of-mass and 
correspondingly the equation of 
motion of this center-of-mass. 
We assume that this center-of- 
mass Xcm{T), where t is the 
proper time, lies within the 
body, that is, {xcm(T)|r G K} C 
suppT'^'^, see Fig. 2. With this 
center-of-mass worldline there 

is connected the 4-velocity v = 
d ^ 

dr-^cm- 

Fig. 2. The world-tube of space-time points In addition, one can define 
for which the energy-momentum tensor in non- an angular momentum with re- 
vanishing. The center-of-mass wordline is dotted, gpgct to this distinguished point 

is space-time and derive its 
temporal evolution. In doing so we follow the procedure originated by Papa- 
petrou [2,3] and developed further by Dixon and [4] and Ehlers and Rudolph 

[5]. 

We first introduce a 3-1- 1-slicing of the space-time by introducing hypersur- 
faces St with normal n^, see Fig. 2. From the energy momentum tensor we can 
define various moments = ^J—g 







:= f SxP^SxP^ ■■■5xP-‘ZP''dPx, ( 10 ) 

JSt 

where 5xP = x^ — x^^ with x,Xcm G St- We use coordinates so that t = const. 
over Sf. Consequently, Sx^ = 0 in (9) and = 5°. 

We can distinguish between various types of particles: If all momenta (9) but 
the symmetrical 




AfP'' = 



( 11 ) 
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vanish, then we have a particle which possesses no internal structure and thus 
is called a point particle. It will become clear in the following that in this case 

J St 

can be identified with the momentum of that point particle. — If all momenta 
but and 

(13) 

J St 

vanish, then we have a particle with mass and, in addition, an orbital angular 
momentum which in this case is given by 

LP'' := = 2 j . (14) 

Jst 

That this quantity indeed describes the angular momentum will become clear 
later, too. This quantity characterizes a special type of internal motion, so that 
we call this type of matter a spinning particle (please note, that this notion 
“spin” does not mean the elementary particle spin), or spinning top. It can be 
shown that the order of the highest moment is an invariant [2] . That means that a 
spinning particle cannot become a point particle by a coordinate transformation. 

We assume in the following that all other moments but (11) and (13) vanish. 
From these definitions and the basic equation of motion (8) we first derive the 
equations of motion for a point particle, and second for a spinning particle. 

The point particle. From (8) we have 0 = 8^%^^ + from 



which we get by integration 

^PP= [ doTP°d^x = -{P,}M’'^ . (15) 

at Jst 

In an analogous way we analyze the quantity f xP^P'^cfix. With (8) we get 

ao(x^T^°) + dt{xP‘lP^) = - xP{ P, (16) 

and thus, by integration, 

MPP=^f xP^P°d^x+ [ xP{P,}^'^^d^x. (17) 

dt Jst Jst 

We expand x and {Per} around the coordinate of the worldline Xcm 

X = Xcm + Sx, {Jfa}(x) = + Sx'^dtt{J}j}iXcm) (18) 

and get, taking into account the condition for a point particle, / SxTd^x = 0, 

MPP = vPpP. (19) 
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Since is symmetric, we must have = mv^ with m := Therefore 

= m v^v'' , and, consequently, we get from (15) D^{mv) = 0. From this 
and the fact that w is a normalized 4-velocity, g{v,v) = —1, we get Dy-m = 0 
which means that m is the mass of the particle which is constant. Then we also 
have the geodesic equation for the center-of-mass trajectory: 

Dyv = 0. (20) 

The spinning particle. This kind of particle is defined by f ^ 0 and 

f Sx^‘I‘'^d^x yf 0; all other moments vanish. We consider the divergences 9pT^^, 
dp{x^T‘"P), and dp{x^x"‘I'^P) and get in a way analogous to above 

= -{ p^, }MP^ - d,{ p^a , (21) 

, (22) 

, (23) 

where all Christoffel symbols are evaluated at the center-of-mass position Xcm 
and where, in addition to (11), we used (13). With definition (13) we also have 
(note 5x^ = 0) 

^ ^ (24) 

^ ^mOO (25) 

Now we first express, using (23), in terms of v and L, and second, using 

(22), the propagation of L, and, at last, with (21) the equation for the center- 
of-mass motion. 

Cyclic permutation of the three indices in (23) and adding two and subtract- 
ing the third relation gives 

2M^^'P = vP{M''P° + MP^°) + v'^LPP + vPpP '' , (26) 

where we used (24). By specifying cr = 0 in (23), we can express the first part 

also in terms of the angular momentum, + v'^ , so that 

we finally find 

2MP''P = vP{v''LP° + yPP''^) + v^PPP + yPP^'' . (27) 

Choosing ly = 0 in (22) and reinserting this into (22) gives 
0 = }MPP'^ - MP’' + yP (y''M°° + }Af'P'^ 

CLL \ (JjL 

(28) 

Antisymmetrization leads to an equation of motion for L: 

0 = J^LP'' - { P„ }M''P^ + { p-; }MPP'^ + yP }M''p^^ ~ (g ^ ip) ■ 

( 29 ) 
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Inserting in terms of the center-of-mass velocity and the angular mo- 

mentum, Eq.(27), we obtain a covariant equation of motion for the angular 
momentum L\ 

0 = . (30) 

Multiplication with Vi, gives DyL^^ = —v^DyL^'^ — v^v^DyL'^^ which can be 
inserted into (30) 

0 = DyL^^'' + v^^VpDyLP'' + v’'vpDyLPP = PyDyLP’' . (31) 

This is the equation of motion for the angular momentum. 

In a similar fashion [2] we derive from (21) the equation of motion for the 
path. We get 

Dy {mvP - VpDyLPP) = ^RP.p^v'^LP^ . (32) 

By counting the degrees of freedom, it is clear that (31) and (32) are 6 
equations for 3 components of and 6 components of . Therefore we have 
to reduce the numer of unknown components in the angular momentum. What 
is still unspecified in our approach is the center-of-mass coordinate. The center- 
of-mass coordinate can be determined by the so-called Frenkel condition 

0 = LP''v^ = v^ j 5x^P‘l''^°d^x , (33) 

which leads to an expression of the form 0 = f pSx^d^xF relativistic corrections, 
where p = is the energy density, see [5], e.g., for a detailed treatment of the 
center-of-mass problem. 

If the Frenkel condition is valid, then it makes sense to introduce a vector 
for the angular momentum := ^eP'^P'^v^Lpcr. In terms of this vector, Eq.(30) 
means that the angular momentum vector is Fermi propagated, 

FyL = PyDyL = 0 (34) 

and thus is non-rotating. 

3.2 Motion of an Elementary Particle with Spin | 

A spin-i-particle il) is assumed to obey an equation of motion which can be de- 
rived from the minimally coupled Lagrangian for the Dirac field in a Riemannian 
geometry: 

'LNj 

C = — {ipjPDpip - {Dpip)-fPi;) - mill'll) . (35) 

The parameter m is the mass of the Dirac particle. The matrices 7^ are given by 
7^ = where the 7“ are the special relativistic Dirac matrices obeying the 
Clifford algebra 7“7** + = 277“^ (7“^ = diag(— 1, -1-1, -1-1, -1-1)). The tetrads 

hP are defined by = rjab- Therefore 



7^7-^ -g Y-iP = 2gP'' . 



(36) 
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is the covariant spinorial derivative with the spinorial connec- 

tion := —\{D ^h'^)hbvG°'^ (the covariant derivative acts on the vectorial 
index /i in D^h'^ only). The := |(7“7*' — 7^7“) are the generators of the 
Lorentz-group. The adjoint spinor is defined by 'll! := (in this case ipilj 

transforms as a scalar). Here 7^°^ is the zeroth special relativistic Dirac-matrix. 
Variation of the above Lagrangian with respect to ip gives the field equation 

0 = — rmp . (37) 

This is the Dirac equation in curved space-time. Here we use c = 1. 

Now we describe a particle in a quasiclassical approximation. That means, 
we look for a solution of the Dirac equation (37) which locally has the form of a 
plane wave: 

^ = e*®(")a(x) . (38) 

Inserting this ansatz into the field equations (37) gives 

0 = + m)a + ihj^Dfj^a . (39) 

The main step of the quasiclassical approximation consists in the assumption 
that the external fields are weak enough, so that, in first approximation, the 
derivatives of the amplitudes can be neglected. That means + m)a + 

ihj^D^a « + m)a, or <C |ma|, where | • | denotes some norm 

on a complex vector space. If we use this condition, then we get from (39) with 

Pbi ■= -d^iS 

0 = - m)a . (40) 

Here is the momentum of the plane wave. Eq (40) is an algebraic condition 
which possesses a solution for p^^ if and only if the determinant of the coefficient 
matrix vanishes, 0 = det(y^p^ — m). This leads to the condition 

0 = ■ ( 41 ) 

(The exponent 2 characterizes the fact that we have for both spin states the 
same mass shell.) Eq (41) is a Hamilton-Jacobi partial differential equation for 
the phase S{x,t) which always possess a solution. 

From the plane wave ansatz (38) we can define a wave packet by superposition 
of plane waves from a continuous spectrum of momenta peaked around p^. Then 
one can show that the tangent vector of the path of the peak of this wave packet 
is given by the group velocity := ■^g^'^Pi,\p=p which fulfills the normalization 
condition, g(v,v) = —1. Differentiating (41) once more yields immediately the 
geodesic equation for this group velocity 

0 = D^v. (42) 

The integral curves of this geodesic equation are the paths of the peaks of wave 
packets. 
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If we have a solution of the first order equation, then the first part in (39) 
vanishes and we get as equation for the next order of approximation 

0 = . (43) 

What we are looking for is a propagation equation for the amplitude, that is, an 
equation of the form D^a = f{x,v)a which describes the evolution of a along 
the path given by v. For this we multiply (43) with + m)jm and get 

0 = 

m 

= — (7^ {Dp. {{PuY + m)a) - D^(YPu + m)a) + 7^]-D^a) 

m 

= -Dpv^a - 2v>^Dpa . (44) 

Here we used d[pP^] = 0. For obtaining this result, the existence of a Clifford 
algebra is important. With the definition for the expansion 9 := DpV^ we finally 
find 

DyQ = ~2^o, . (45) 

The same holds true for the adjoint spinor: DyO, = —^9 a. Within the frame of 
the theory of congruences [6] (see also the Appendix), 9 is interpreted as the 
divergence of the trajectories given by the phase S{x). If we define a normalized 
spinorial amplitude b := then we get [7,8] 

= 0 , = 0 . (46) 

That means that the normalized spinors h and h are parallely propagated along 
the path of the center of the wave packet. 

With these propagation equations for the spinors h and b, we can calculate 



propagation equations for the bilinears [9] S := bb, P := bi^^b, 
'■= %J^b, and := b2iG^''h. Using (40) one can derive 


jP 


II 


(47) 


S^Pp = 0 , 


(48) 


P = 0, 


(49) 


e^'^P^S'^Pp = -mSP’' . 


(50) 


The last relation can be inverted: 






(51) 


From these identities we get an interpretation of these bilinear quantities. 
Eq.(47) yields j“ = Sv‘^ so that S is the intensity of the Dirac field. Since 5'“ is 
an axial vector and, according to (48) a rest-frame quantity, it is identified with 
the spin of the Dirac particle. 
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Therefore, the only independent normalized quantities are the normalized 
current = b'y^b = and the normalized spin-vector = tr/^j^b. The 
propagation equations (46) for b and b then give propagation equations for j'' 
and S^: 



D^v = 0 , (52) 

D^S = 0 . (53) 

Therefore the direction of the spin is parallely propagated along the path of 
the Dirac particle. The spin behaves in the same way as a spinning top. (For a 
gravitational theory with torsion it can be shown that the spin couples to torsion 
while the orbital angular momentum does not [10].) 

We note without proof that in the next approximation it is possible to get 
an influence of the spin on the path of the wave packet [8] : 

Dyv = ^XcR*(-,v,S,v) , (54) 

where R* is the right-dual of the curvature tensor and c the Compton wave- 

length of the Dirac particle. 



4 Gravitational Field of a Rotating Body 



In this section we want to derive the general features of a gravitational field 
which is created by a rotating body. The gravitational field, that is, the space- 
time metric is given by Einstein’s equations 






T„, 



(55) 



where G = 6.673(10) x 10“^^ m^kg“^s“^ is Newton’s gravitational constant [11]. 
We now analyze two aspects of a rotation in the gravitational field: (i) we discuss 
the gravitational field of an arbitrary stationary situation and (ii) discuss the 
general structure of the gravitational field created from a rotating mass given in 
form of the energy momentum tensor. 



4.1 Stationary Gravitational Field 

A stationary gravitational field is characterized by a time-like Killing vector 
g{^,0 < 0, with £^g = 0, whereas a gravitational field with an axial symmetry 
is characterized by a space-like Killing vector rj, g{g, rf) > 0, with £^^g = 0 and 
the integral curves of g are space-like closed curves. An axisymmetric space- 
time possess both Killing vectors ^ and g which, in addition, should commute 
[C)^] = 0- For any Killing vector field then there exists a coordinate system, 
which coordinate lines are the integral curves of the Killing vector field, so that 
the metric does not depend on the coordinates corresponding to the Killing field. 
For a stationary gravitational field this means g{x) = g{x^,x^,x^) and, if the 
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gravitational field possesses an additional axial symmetry, then g{x) = g{x^ , x^), 
where x^ plays the role of the time coordinate and x^ the role of the angle (p. 
Therefore = gfj,^{x^ ,x^)dx^dx‘' . If we choose x^ = p and x^ = then 

ds^ = 5oo(P) ’d) dt^ + 2goi{p, id) dtdx'' + gij{p, d) dx* dx^ . (56) 

We call a gravitational field static if the rotation of the time-like Killing 
congruence vanishes (cf. Appendix): w = 0 or = 0. The gravitational 

field is stationary, if the Killing congruence rotates: w yf 0 or 0. 

The standard example for a static space-time is given by the Schwarzschild 
solution, and an example for a stationary space-time is given by the Kerr so- 
lution, see [12], e.g., or the space-time determined from a thin rotating disk 

[13]. 

4.2 Gravitational Field of a Rotating Source 

It is intuitively clear what a rotating source is: The source consists of a set 
of point particles (a gas, or a rigid body, for example), which form a rotating 
congruence. The particles of the source may interact with one another. Therefore, 
the source is a conguence of point particles moving on trajectories with 4-velocity 
u. This congruence may possess rotation, acceleration, expansion, and shear. If 
we have, as a very simple example, a perfect fluid, then we have as source of 
the gravitational field the energy-momentum tensor = {p + p)u^u’' +pg^'^, 
where p is the energy density and p the pressure. For p = 0 (dust) one can show 
that the geodesic equation (1) follows from (8). If the vector field u belongs to 
a rotating congruence, then this energy-momentum tensor describes a rotating 
source. 

We now calculate the gravitational field which is created by such a rotating 
source. For this purpose, we split the metric into two parts, g = go + gi- The 
curvature associated with g can be split into a term corresponding to go and 
terms depending on gi [12]: R{g) = R{go) + SR{go,gi)- We assume that the 
curvature associated with go vanishes, R{go) = 0. From the Einstein equations 
(55) we finally get a differential equation for the part gi := gi — ^gigo with 

- UI/ 

9 ■= 9o 9iiiu- 

□ ffi = KT, (57) 

where □ is the d’Alambertian with respect to the metric go and where we have 
chosen a coordinate system such that = 0, with g^'' := g'o'^ go^ gipa- Be- 

cause the curvature associated with go vanishes, it is possible to introcduce a 
global coordinate system such that gopu = If the source is stationary, then gi 

does not depend on the time and (57) reduces to the Poisson equation Agi = kT 
which can be integrated, 

provided T falls off appropriately at spatial infinity. 
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The component Tqq =: p of the energy-momentum tensor is interpreted as 
the energy density, the components T^o = Tio =: pvi as the energy flux, and the 
Tij as the stress tensor (i, j, . . . = 1, 2, 3). Therefore, 



(51)00 ^ ^ J 

{gi)io ^ ^ J 
{9i)ij ^ ^ J 



.3 / 

p{t,r')v,{t,r') . , 






It is clear that (5i)io is smaller than (51)00 by a factor v/c and (51 )y by a factor 
(u/c)^, cf. [12]. In the case of an isolated body and large distances, we have 

M C (r y. 

[/:= (51)00 = G - , fe^:=(5l).o = -^ ^ (62) 

r 2 

where M is the total mass and L the angular momentum of the gravitating 
body. 

In a coordinate system where the components of the metric are isotropic, we 
have as line element 

= —(1 — 2U + 2[/^) dt^ + (1 + 2U){dx^ + dy^ + dz^) — Ahi dx^ dt , (63) 



gfj.i' = 



-1 + 2U-U^ -2h^ 

-2h, (1 + 2U)5, 



This metric is time-independent. Thus a time-like Killing vector ^ = dt exists, 
in components = Sq. The different components of this vector ^ are given by 

= <^o 1 Co = 5oi/C^ = 5oo , C* = 5 *mC^ = 5o* = —‘2hi . (65) 

Hence the curl of the Killing vector held (see Appendix) is connected with d^ihjp 
The tetrads (one-forms, a = 0, ... 3) connected with the Killing vector 
held ^ are given by 



f = -l + G-^C/2, 


= -h^ 


(66) 


= -ha , 


df = (1 + U)Sf . 


(67) 



Later we need to boost this tetrad to a comoving (with the gyroscope) tetrad 
■5“ _ where x is the relative velocity of the gyroscope with respect 

to the tetrad d“. Since for a pure boost with small velocity v we have L = 



1 - 2 ± 



• X 

X Sj + ^x’‘Xj 



, we And for the new tetrad 



= -l + U--U^--x\ d'f> = -h, + Xi 



df = (1 + U) Sf + 
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5 Lense-Thirring Effect 

Let us combine the results and notions derived above in order to describe the 
dynamics of a rotating test body in the neighborhood of a gravitating rotating 
body. We consider a stationary situation: The gravitating body rotates with a 
constant angular velocity. In the field of such a body we will consider the motion 
of a point particle as well as the motion of a gyroscope. 

5.1 Motion of a Point Particle 

The equation of motion for a point particle is the geodesic equation DyV = 0. 
The four components of this equation can be evaluated by using the solution 
for a static spherically symmetric mass distribution, that is the Schwarzschild 
solution, or the solution for rotating bodies, like the Kerr metric or the metric 
for a rotating disk of dust [13]. However, in our approach, we restrict ourselves to 
the case of a weak stationary gravitational field. In this case, for a point particle 
without spin and in 3-notation, the geodesic equation reads 

d^v 

— = VI/ + F-2U X (V X W. (70) 

dt^ 

The first term is the Newtonian part whereas F symbolizes nonlinear contribu- 
tions of the gravitostatic field U which are responsible for the perihelion shift, 
for example. The last term is the gravitomagnetic part due to the rotation of the 
gravitating body. Since the motion of a satellite around the earth represents a 
gyroscope, too, this interaction results in a precession of the angular momentum 
of the satellite around the earth. Therefore, the plane of the path of the satellite 
is no longer fixed, as it is in the Schwarzschild case, but starts to process instead. 
Thus the pericenter (perihelion) or the nodes (intersections of the paths of the 
satellite with the equatorial plane of the earth) move. This should be observable 
in an experiment proposed by Ciufolini [14] according to which two excentric 
satellites orbit around the earth. 

5.2 Motion of a Gyroscope 

For the description of the motion of a direction attached to a gyroscope we use 
Eqs.(34) and (32) or (53) and (54). For simplicity we assume that the path is 
geodesic which is very well fulfilled because, according to Eqs.(32) and (54), all 
non-geodesic terms can be neglected for weak gravitational fields. 

The non-rotating frame defined by the gyroscope will be compared with a 
direction given by a fixed star far away from the gravitating body. The light from 
that star comes from a fixed direction. Thus the tangent vector I of light rays 
of this star is stationary: £^l = 0. Accordingly, we can introduce a stationary 
space-like unit vector: C(i) = Pul/{g{Pul, Pyl))^^^, which again is stationary 
= 0. We can complete this unit vector to give a 3-bein by adding two 
more spatial unit vectors which are orthogonal to C(i) and to each other and 
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which are stationary, too: g{ea,e~f^ = £^ea = 0. If we take e(o) = u, then 
Ca = {u, Co) defines a tetrad with £^Ca = 0. Accordingly, also for the dual basis 
we have £^-d = 0. 

In order to determine the behaviour of the gyroscope with respect to the 
direction given by the fixed star, we project the spin of the gyroscope on the 
comoving basis connected with the fixed star: := This projection is 

the quantity observed. We calculate the dynamics of this projection S°‘ = dyS°‘, 
where S is parallely displaced along the path of the gyroscope, while the basis 
is Lie-displaced along 

^ Lorentz-transf. L“{, ^ 

comoving with gyro attached to distant stars 




D^S = 0 = 0 

The four-velocity of the gyroscope is related to the fourth leg u via u = yrt + 
')x°'ea where 7 is the Lorentz factor (1 — and is the relative velocity 

measured between u and v. Moreover, because of = 0, we have 0 = 

L(o)fcT9^(S') = L(o)(o)t9(o)(S') + L(o)at9“(S') so that 

= -iar(5). (71) 

We calculate, using (34) and (53), respectively, (a = 1,2,3), 

5 “ = i?„(r(5)) 

= {Dj-){S)+d^{D,S) 

= {Dj-){S) 

= D^L\§\S) + L\{D,-9'>){S) . (72) 

/ 1 _1_ i 

The first term can be evaluated by using L“a = ( ga 

\ S 2 

D.L\ = (*|f -aj.)) • With (71) this yields 

D^L\§\S) = + D,L--^d\s) « -\{vS ~ . (73) 

For the seond term 

L\{D^^^){S) = « L\{Du^’^){S) = L\e~^ {D^d^){S){74) 

we use the fact that the frame d is stationary: = 0. In components: 0 = 

so that Thus 

-KD^Sr = -^“(e'^u;/ + ir - • 







(75) 
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Therefore we find for the total precession of the spin 
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(76) 



with = 0 and where we neglected terms with “velocity x gravitomagnetic 

field” and terms of the order x^. Using x“ = a“ + a“, where a is any non- 
gravitational acceleration, we finally arrive in 3-notation at 




OxS, 



(77) 



with 



S7 = V X 



1 3 ' 



V xh. 



(78) 



The first term t) x a is a special relativistic term, called the Thomas precession 
which is known from atomic physics. It describes the precession of the spin due 
to inertial forces. Thus, the second term, v x Vf7, is a gravity-induced Thomas 
precession, the so-called de Sitter precession or geodetic precession. Note that 
only the Newtonian potential enters this term. The last term is purely post- 
Newtonian and describes the Lense-Thirring effect. This is the rotation of the 
locally non-rotating frame with respect to distant fixed stars due to the rotation 
of a nearby rotating gravitating body (‘frame dragging’). 



6 On the Observation of Gravitomagnetic Effects 

The systematic analysis of relativistic effects of planetary motion and motion 
of the moons of the planets dates back to the first years after the publication 
of Einstein’s theory in 1915. In 1916, W. de Sitter [15] predicted a geodetic 
precession of the rotating earth-moon system (‘earth-moon gyroscope’) in the 
gravitational field of the sun. (The de Sitter term in (78) provides a simple 
model of the dynamics of that system.) The effect has finally been detected in 
the late 80’s by means of an elaborate combination of lunar ranging and radio 
interferometry data [16]; refined data can be found in [17]. The accuracy of 
this verification is of the order of 1%. Three years after de Sitter, J. Lense and 
H. Thirring published their pioneering work “about the influence of the proper 
rotation of the central bodies on the motion of the planets and moons according 
to Einstein’s gravitational theory” [18]. Analyzing the equations of motion (70), 
they excluded measurable effects for the moon orbit as well as for the orbits 
of the planets, but found considerable secular relativistic perturbations of the 
orbital parameters of the moons of the outer planets. In particular, Jupiter V 
evidenced a gravitomagnetic shift of its pericenter of 2.26 arcsec/yr. However, a 
confirmation of their prediction by observation was not possible at that time. 
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In the next two sections, we will discuss the theoretical fundamentals of recent 
satellite experiments. In this context, the earth will be considered to be a sphere 
(radius R) rotating with a constant angular velocity about its axis, which has a 
fixed orientation in an inertial system connected with the (distant) stars. 



6.1 Lense Thirring Effect for Point Particles 

According to (70) and (62), the equation of motion for a spinless particle (satel- 
lite) in the gravitational field of the rotating earth is given by^ 



d'^r 1 



2G dr 
dt 



L-'^r 



(79) 



where M is the mass of the earth and L is its angular momentum. A detailed 
discussion of these equations and explicit expressions for the relativistic pertur- 
bations of the particle orbit may be found in the original paper of Lense and 
Thirring [18]. 

According to the proposal of Ciufolini, it should be possible, with present- 
day technology, to measure the advance of the pericenter and the nodes of highly 
eccentric satellites. A first attempt to do this using the LAGEOS I and LAGEOS 
II satellites, has been carried through in [19-21]. It was possible to verify the 
gravitomagnetic effect with an precision of about 10%. This poor precision is a 
result of the low eccentricity of the orbits of the satellites and the difficulties in 
eliminating the multipoles of the earth which give rise to contributions of com- 
parable order. The idea of a further experiment [14] is to orbit a new LAGEOS 
satellite with the same orbital parameters as those of an existing LAGEOS, but 
with supplementary inclinations, and to observe the bisector of the angle be- 
tween the nodal lines which defines a kind of gyroscope. The expected precision 
of the verification of the gravitomagnetic effect is of the order of 3% after 3 years 
of Laser measurements (see also [22]). 



6.2 Lense Thirring Effect for Gyroscopes 

Since the early 60’s, the Stanford orbiting gyroscope experiment. Gravity Probe 
B, has been under development [23]. The experimental construction has been 
completed and the experiment should be performed in 2000/2001. The idea is 
to put a spacecraft in a polar orbit equipped with four gyroscopes (see Fig.l on 
page 53 of Everitts’s paper in this volume) and to measure the gravitomagnetic 
precession of the spins of the gyroscopes. To calculate the expected numerical 
values for the de Sitter and Lense-Thirring effects, we consider a single gyroscope 
with spin S at a circular polar orbit. We introduce a co-moving but non-rotating 
orthogonal coordinate system E the z-axis of which is parallel to the earth’s 
angular momentum L and the x-axis of which lies in the orbital plane (y = 0), 
which has a fixed position with respect to the distant stars. The orientation of 
the co-moving frame can be maintained by two telescopes on board the satellite 
each of which points at a particular fixed star. 

^ For simplicity, the nonlinear term of (70) has been omitted. 
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Fig. 3. The Lense-Thirring effect for gyroscopes: The earth rotating with angular 
velocity a; and angular momentum L creates a gravitomagnetic field with the shape of 
a magnetic dipole. A gyroscope with angular momentum or spin S moves around the 
earth along a geodesic circular polar orbit (thick solid line). The Lense-Thirring effect 
consists in the precession of S around the direction given by the field lines of V x /i. 



In order to apply Eqs. (77,78), we have to specify the position vector r of 
the gyroscope (origin of E) and the angular velocity f2. Obviously, 

r = {r cos uiot,0,r sin ujQt) . (80) 

Here r is the constant distance of the gyroscope from the center of the earth and 



1 GM 



(jjQ — 

r V r 

is the orbital angular velocity of the satellite. Then, 

^ „ 3 IGM GM GL ^ 

^ “ 3 cos 2u;ot)^ 

A good approximation for the angular momentum of the earth is 

L = 0.3306 • MR^oj, 



(81) 



(82) 



(83) 



where M = (5.974 ± 0.004) 10^^ g is the mass, R = (6378140 ± 5) m the equa- 
torial radius, and oj the angular velocity of the earth. 
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Note that the choice of S has separated the de Sitter and the Lense-Thirring 
contributions to fi: fix and fiz are pure Lense-Thirring terms whereas fiy is of 
geodetic origin. After a decomposition of the spin vector S in spherical polar 
c o or dm Sit G s 

Sx = S cos (fi Slu'd , Sy = S sin ip sin , Sz = S cos'd, S' = [S'] , (84) 

Eqs. (77,78) take the form 

ip = — fix cos p cot -d — fiy cot "d sinp + fiz , (85) 

•d = — fix sin p + fiy cos p . (86) 

In order to keep the two effects separate, we may start from an equatorial position 
(t = 0;r = (r, 0,0)) and choose the spin vector S to be perpendicular to the 
angular momentum L of the earth (t = 0 : p = 0,'d = ^). From the linearized 
Eqs.(85,86) 

z = ifixZ + {fiy + ifiz) , (87) 

where z = “d — ^ + ip, we finally obtain the desired secular angular precessions 



A'd = fiyAt = 



3 Igm GM , 

o\ 

2 \ r 



A _ t . LjIj . 

Ap = Hz At=—At. 



(88) 

(89) 



Here fi* is the time-averaged fiz as experienced by the gyroscope. For a polar 
orbit at about 650 km altitude (r = (6371 + 650) km) this leads (note also (83)) 
to a rate of 

= 6.6 arcsec/yr (90) 

for the geodetic precession and 



Ap 



0.041 arcsec/yr 



(91) 



for the Lense-Thirring precession. 

The goal for the precision of the Gravity Probe B experiment is about 0.01% 
for the de Sitter effect and about 1% for the Lense-Thirring effect (in contrast 
to the Lense-Thirring effect for orbiting point particles the multipole moments 
of the earth play no role here [24]). The measurement of these effects is based 
on SQLTDs. The numerical values (90) are illustrated in Fig.l of Everitt’s talk 
(see p. 53), which, moreover, describes the technicalities o f the sophisticated 
equipment. 



6.3 Lense Thirring Effect in Quantum Physics 

Due to huge improvements in the accuracy of devices based on the quantum 
properties of matter, it may be useful to estimate the effect of a rotating grav- 
itating body on quantum particles. Two types of effects can be imagined: The 
effect on a matter wave interferometer and the effect on the spectrum of atoms. 
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In the first case, the field acts like a rotation of the interferometer if 
the interferometer is attached to the fixed stars. The effect of rotation of the 
interferometer on the phase of the quantum interference is the famous Sagnac 
effect i5^sagnac = "'^here a; is the angular velocity of the interferometer 

and A its area. Due to great success, e.g., in the cooling of atoms which makes 
it possible to prepare interfering atoms which stay for a long time inside the 
interferometer and thus possess a long interaction time, it may be possible to 
detect the Lense-Thirring effect with atomic interferometry, see [25] for a recent 
account on the sensitivity of atomic interferometers on rotation. 

A realization of this effect is attempted within the HYPER project which is 
planned to put atomic interferometers in space and to measure, beside the fine 
structure constant and the quantum gravity induced foam structure of space, the 
Lense-Thirring effect. For this purpose, two atomic interferometers based on Mg 
and two based on Cs will be placed in two orthogonal planes. The resolution of 
rotation rates aimed at is rad/s for an integration time of 1000 s. Note that, 
contrary to the GP-B approach where the cumulative effect over apoproximately 
one year is read out, in this case the angular velocity 'V x h is measured locally. 
No integration over many days is carried through. The integration takes place 
for a few minutes only, that is, for a duration, over which the curl V x is 
approximately constant. HYPER is planned to be put into orbit within the next 
10 years. 

It has been shown that the rotation of the earth has an influence on the 
spectrum of atoms: While searching for anomalous spin-couplings in atoms [26], 
one has to compensate for the influence of the earth’s rotation on the spin. This 
in fact establishes [27] an experimental verification of the coupling between spin 
and rotation, see [28] . However, since the accuracy of this result is not very high, 
and since the rotation caused by f2 is about 9 orders of magnitude smaller than 
the earth’s rotation, there is no hope in near future to use this approach for a 
verification of the Lense-Thirring effect. 

Appendix: Theory of Congruences 

For a time-like vector field u, g{u,u) = —1, which may be interpreted as a field 
of four velocities being tangents at a set of point-like particles, like dust, we 
have [6] 

= UJi/fi “t“ U^Qi/ , (92) 

where 

:= i.Pu)ui.Pu)1tDy„Up\ , (93) 

9 := P^DinUu) , (94) 

:= {PX{Pu);D^.u,) - ^0(P„)^. , (95) 

a := D^u , (96) 
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with the projection operator 



{PuY,-.= 5^, + u''u,. (97) 

For a time-like Killing congruence, that is, a congruence the four-velocity of 
which is proportional to a Killing vector field, 

C=e^M, £^g = Q, g{u,u) = -I , (98) 

we have = 0 and 0 = 0. A Killing congruence possesses only rotation and 
acceleration. (One can show, that w is, indeed, a rotation as defined in section 
2.) Then the acceleration of u is given by 

a := DuU = dU , DuU = 0 , (99) 

which we get from projecting 0 = onto u'' and u^u'' . With this 

result we get 

= D^{e^Uy) = e^ujy^ + . ( 100 ) 
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Abstract. NASA’s Gravity Probe B Mission is a test of two predictions of Einstein’s 
General Theory of Relativity based on observations on very precise cryogenic gyro- 
scopes in a satellite in a 650 km polar orbit about the Earth. Construction and the 
first round of testing of the flight payload was completed in December 1999. Of the 32 
planned qualification tests 28 were passed with complete success, meeting or in several 
instances surpassing the program requirements. However, one test very unexpectedly 
revealed a p roblem in the thermal performance of the Dewar/Probe system which has 
required a significant redesigin and rework, now successfully completed. Gravity Probe 
B is scheduled for launch on April 1, 2002. This article reviews from the physicist’s 
viewpoint the experience of living through a space flight program. 



1 Gravity Probe B: An Experiment in Physics — 
and Management 

To physicists trained in ground-based research the thought of carrying out an 
experiment in space is a daunting prospect. The laboratory experiments we are 
accustomed to seldom work at a first attempt: many stages of redesign and 
reconstruction are necessary before the desired performance can be met. Space 
is different. Space experiments of their very nature have to be designed and built 
so that once on orbit they will work first time. Furthermore the design has to be 
‘robust’. Robustness means introducing redundancy into areas that might fail. 
Redundancy, however, adds complexity. Striking the right balance between these 
competing requirements is critical to any successful space mission. By now, after 
forty years of space flight, considerable experience exists about how to approach 
such issues (not least how to bring together the right mix of science, engineering 
and management skills to do so). 

The NASA/Stanford Gravity Probe B Mission (GP-B) is a Fundamental 
Physics experiment designed to provide two extremely precise tests of Einstein’s 
theory of gravitation, General Relativity, based on observations of electrically 
suspended gyroscopes in a satellite in a 640 km circular polar orbit around 
the Earth, see Fig.l. It will measure the geodetic effect due to the curvature of 
space-time by the Earth to approximately 2 parts in 10® and the frame-dragging 
effect, with its subtle connections to gravitomagnetism and Mach’s principle, to 
an accuracy approaching 0.3%. 



C. Lammerzahl, C.W.F. Francis, and F.W. Hehl (Eds.): LNP 562, pp. 52-82, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 




Gravity Probe B 



53 




Fig. 1. The Two Relativity Effects Predicted by Schiff. 



A crucial feature of GP-B is that it is ‘a controlled physics experiment’. It is 
so in two senses. In the first place the two relativistic effects dominate the data: 
error terms such as the Newtonian drifts of gyroscopes are reduced to negligible 
values. Second, no less important, designed into the experiment is a rigorous 
program of on-orbit verification and calibration in which mission parameters 
are varied in a controlled way to check and, if necessary, remove specific sources 
of err or. 

After many years of technology development, GP-B is now nearing comple- 
tion. Final assembly of the payload with the spacecraft takes place in March 
2001. Launch is scheduled for May 1, 2002. 

It was not until late in 1959, 44 years after Einstein published his theory, that 
any conceptually new test of General Relativity was proposed. It was then that 
two men, Leonard Schiff and George Pugh, in complete independence of each 
other, were able to define one, or rather two, fundamentally new tests based 
on observations of non-Newtonian precessions of gyroscopes in Earth orbit. Re- 
markably, almost at the same time a number of other tests of the theory were 
conceived including the Shapiro time delay ex periment and redshift tests lead- 
ing to the Gravity Probe A sub-orbital clock experiment. Then came various 
new or extended theoretical approaches, best known of which is the param- 
eterized post-Newtonian (PPN) formalism due mainly to Kenneth Nordtvedt 
and Glifford Will. PPN provides a framework for comparing metric theories of 
gravitation; it has also suggested interesting new null experiments. 

It is no accident that Schiff and Pugh came to their ideas just two years 
after Sputnik. Both recognized that an orbiting gyroscope would experience the 
geodetic and frame-dragging effects. They also saw that the performance of 
a gyroscope in space is potentially much better than on Earth because of the 
reduction in support force. Schiff ’s principal contribution was to provide the first 
elegant and correct derivation of the two effects. Pugh in a striking, too little 
known paper (November 1959) gave an imp ressively complete error analysis of 
a possible experiment. In particular he suggested for the first time the brilliant 
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and far-reaching concept of a drag-free or drag-compensated satellite. Gravity 
Probe B has evolved out of these early thoughts, together with discussions Schiff 
had at Stanford in 1959-60 with William Fairbank of the Physics Department 
and Robert Cannon of the Department of Aeronautics and Astronautics. From 
there GP-B became a cooperative effort between the two departments, a most 
for tunate occurrence. 

Indeed, without a collaboration of this kind it is hard to imagine even at- 
tempting a program involving so many diverse technologies. In this test of Ein- 
stein, physics and engineering are inextricably linked. Important also has been 
the presence on the Stanford campus of an infrastructure capable of making the 
collaboration effective, provided by the interdisciplinary Hansen Experimental 
Physics Laboratory (HEPL) with its long history of developing medium-scale, 
long-term scientific programs. HEPL has prov ed to be an ideal administra- 
tive vehicle for a mission of this complexity. In addition, a rarity on university 
campuses, it offers an extended building with flexible high bay space and crane 
coverage. To have such a building within easy walking distance of the two depart- 
ments so that physics and engineering students, academic staff and faculty can 
come together in a common enterprise is an extraordinary benefit. This happy 
combination of circumstances, not planned in advance, deserves reflection. Oth- 
ers int ending to perform fundamental space experiments may And it wise to 
think through not only the experiment itself but the structures and expertise 
that will be needed to make it happen. 

No less important is to think through, in conjunction with NASA or the 
relevant space agency, the relationships between academia and aerospace indus- 
try. On GP-B with invaluable support from NASA Marshall Genter, excellent 
working relations were established with Ball Aerospace in the early study phase 
of the program and subsequently in flight contracts with Lockheed Martin. In 
establishing the university-industry relation it is essential to formulate a clear 
plan of what should be done in academia and what in industry, and to devise a 
suitable contractual vehicle for managing the industrial effort. In facing, as one 
must, technical surprises, it is well to be prearmed as much as possible against 
the contractual surprises that too often accompany them. 

2 Shape of the Experiment 

The GP-B Science Instrument Assembly (SI A) is illustrated schematically in 
Fig. 2. It comprises four gyroscopes with their spin axes aligned parallel to the 
line of sight to the guide star and mounted in line in a quartz block structure, 
to which is attached a reference telescope, all operating at a temperature of 1.8 
K. The gyroscopes are fused quartz spheres 38 mm in diameter, coated with a 
1.25 ^m film of superconducting niobium, suspended electrically within fused 
quartz housings by voltages applied to three mutually perpendicular saucer- 
shaped electrodes, as illustrated in Fig. 3. To support the rotor on Earth the 
voltage required over the 32 /rm gap is 650 V. In space it is reduced to 0.1 V. 
The gyroscopes are spun up on orbit to about 100 Hz by means of helium gas at 
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Fig. 2. The GP-B Science Instrument Assembly. 



a temperature of 6.5 K running at sonic velocity through a differentially pumped 
channel in one half of the housing. 

The gyro readout is based on a special property of superconductors known 
as the London moment. A spinning superconductor develops a magnetic field 
aligned with its instantaneous spin axis, and this serves as a magnetic ‘pointer’ 
even in a perfectly round, perfectly uniform sphere. The direction of the London 
moment, and therefore of the spin, is measured magnetically by a SQUID mag- 
netometer connected to a four-turn superconducting loop sputtered on to the 
parting plane of the housing. Because the spacecraft rolls around the line of sight 
to the star the signal is chopped at roll rate, reducing limits on measurement 
from 1// noise. The roll period is between one and three minutes (17 to 5.5 
mHz). At 5.5 mHz the noise performance is about 7 x 10“^® J/Hz, equivalent to 
resolving 1 mas^ in an integration time of 7 hours, meeting the GP-B mission 
goal of 1 mas in 10 hours. 

More generally, rolling the spacecraft proves to be a very important overall 
symmetrizing principles, averaging out other potential sources of error in Gravity 
Probe B: drifts in the telescope readout, and many of the Newtonian drift torques 
on the gyroscope. 

The SIA is enclosed in a cylindrical cryogenic vacuum ‘probe’, 0.3 m in di- 
ameter and 2.4 m long, inserted into the inner well of a dewar vessel containing 
2400 1 of superfiuid helium (Fig. 4). The dewar, which maintains cryogenic tem- 
peratures on orbit for 18 months, serves as the main structural element of the 
spacecraft. Boil-off gas from the dewar, vented through proportional thrusters, 
provides thrust authority for attitude, translational, and roll control. Also shown 
in Fig. 4 is a sunshade that prevents scattered sunlight entering the telescope. 



1 



1 mas = 1 milli-arc-second. 
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Fig. 3. The GP-B Flight Gyroscope: Note the difference between the two halves of the 
gyro housing. Each has three circular support electrodes, but the one on the right has 
the differentially pumped spin-up channel, while the one on the left has the supercon- 
ducting loop sputtered on to its parting-plane. 



An essential feature of the experiment is the provision of both dc and ac 
magnetic shielding. For the London moment readout to work, the field trapped 
in the gyro rotor during cooling through its superconducting transition temper- 
ature must be below the extremely low value of 3 x 10“^° T - well below the 
limit achievable in practice by conventional ferromagnetic shields. Equally im- 
portant is to eliminate ac disturbances from external magnetic fields such as the 
Earth’s: the required ac shielding fa ctor is 10^^. These requirements are met 
by surrounding the instrument with a nested series of conventional and super- 
conducting magnetic shields, and placing rigorous constraints on the magnetic 
properties of the probe and SIA. To produce the required ultralow magnetic 
field use is made of a technique developed originally by B. Cabrera, based on 
expanding a succession of superconducting lead shields in the inner well of the 
dewar. A conventional ‘cryoperm’ shield produces an initial ambient field in th 
e well of about 10“® T. Because the quantity conserved in superconductors is 
magnetic fiux (field times area), if a folded lead bag is cooled in this low field 
and then expanded, the new field level will be substantially lower. The process is 
then repeated with a second lead bag to obtain an even lower field. In practice, 
with three expansions, field levels as low as 10“^^ T are regularly obtained. 

In their earliest proposals for an orbiting gyroscope experiment Pugh, and 
independently Schiff and his colleagues, recognized that the performance of a 
gyroscope would be greatly improved by operating in the low g environment of 
space. The reason is simple. By far the most precise kind of gyroscope, then and 
now, is an electrically supported spinning sphere. The simplest of the torques 
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Fig. 4. The Science Mission Dewar and Probe-C: Cutaway view, with the Science 
Instrument Assembly in position with its separate vacuum ‘probe’. 



acting on such a suspended sphere is the ‘mass-unbalance’ torque which occurs 
when an acceleration acts on a sphere that is perfectly round but not quite 
perfectly homogeneous. The drift-rate 17 caused by an acceleration / at right 
angles to the spin axis is 



2vs r 



( 1 ) 



where r is the radius and Vs the peripheral velocity of the spinning sphere, 
and Sr is the distance between center of mass and center of support. For a 
sphere homogeneous to 1 part in 10® with a peripheral velocity of 20 m/s the 
resulting mean transverse acceleration required to reduce the drift-rate to 0.03 
mas/yr is 4 x 10“^^ g. This is an instructive number. It demonstrates the insight 
of George Pugh - and also of B.O. Lange, who independently advanced the 
same idea in August 1961 - in recognizing the need for drag compensation. The 
acceleration levels on satellites at the GP-B altitude with normal area/mass 
ratios are typically of the order 10“® g. In Gravity Probe B as a drag-free 
satellite rolling about the common axis of the gyroscopes the mean cross-track 
acceleration will be below 10“^^ g. 

Arguments of the same kind developed in detail by G.M. Keiser and A. 
Silbergleit, and discussed further in Section 4 below, establish that torques due 
to the action of the suspension voltages on the out-of-roundness of the gyroscope 
are reduced to similarly acceptable levels. In all, three different classes of errors 
have to be considered in Gravity Probe B: gyro drift errors, gyro readout errors, 
uncertainty in the proper motion of the guide star. Fig. 5 gives current estimates 
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Fig. 5. Expected GP-B Experiment Accuracy. 



of each of these terms and the result for an overall experiment with one, two, 
three, and four gyroscopes. 

The spacecraft carrying electronics, telemetry, solar panels, mass trim mech- 
anisms, a sunshield for the telescope, and other support equipment, fits around 
the dewar. Fig. 6 is a general view of GP-B in orbit. 

3 Incremental Prototyping 

From 1964 to 1984 the main effort on Gravity Probe B went to advanced tech- 
nology development, along with Mission Definition and Phase A studies and 
an in-house Phase B study performed at NASA Marshall Genter. In US fiscal 
year 1985 NASA decided to proceed with an instrument development program, 
the Shuttle Test of the Relativity Experiment (STORE). As first conceived, the 
object of STORE was to fabricate the Science Payload, and then launch it on 
Shuttle for a seven-day rehearsal experiment that would tes t all systems in 
space and operate the gyros at Shuttle acceleration levels. The Payload would 
then be returned to Earth, mounted into the spacecraft, and launched once more 
by Shuttle from the SL-6 Western Test Range at Vandenberg Air Force Base in 
Galifornia for release into a free-flying polar orbit. 

STORE provided the essential mechanism for developing the Gravity Probe 
B Payload. However, with the Ghallenger accident in 1986 and subsequent clo- 
sure of the Western Test Range the planning had to be rethought. The new 
constraints on Shuttle payloads and the fact that it was no longer possible to 
launch Shuttle into a polar orbit were serious complications. A modified program 
was therefore devised with an increased emphasis on incremental prototyping. 
Although the main design principles of the Scienc e Instrument Assembly were 
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Fig. 6. Gravity Probe B Spacecraft. 



well established by 1985, the engineering challenge was formidable. Equally chal- 
lenging was the design of the cryogenic vacuum ‘probe’ containing the SIA which 
is complex and has many features unique to GP-B^. Thus the neck tube of the 
probe must contain cooled radiation windows to intercept infrared radiation that 
would otherwise be a severe heat load on the SIA and dewar. On the other hand, 
it also must serve as a high speed pumping line to exhaust gas at low pressure 
from differential pumping of the gyro spin system. It was by no means obvious 
in advance how these two requirements could be reconciled. 

Nor could the sophistication of the GP-B dewar be underestimated. Though 
there was important engineering and flight heritage for superfluid helium dewars 
from the IRAS and GOBE programs, some of GP-B’s most critical features 
were unprecedented in space. A major issue was the ultralow magnetic Held 
requirement, which demands having a permanently cold superconducting lead 
bag in the inner well of the dewar and therefore inserting the warm probe into 
the cold dewar. This is an intricate process in which it is necessary to mount 
a temporary airlock on top of the dewar to prevent solid air from condensing 
into the inner well. Once in place, the probe has to be locked down thermally 
and mechanically at appropriate poins in the dewar. Incremental prototyping 
has helped immensely in arriving at a sound design and establishing correct 
insertion procedures. 

Incremental prototyping attacks the hardest engineering problems first by 
using full-size flight prototypical designs with flight interfaces, but with reduced 

^ Following standard cryogenic engineering practice, reference is made in this paper to 
three successive cryogenic ‘probes’ to be inserted into the dewars, Probe-A, Probe-B, 
Probe-C; this terminology is not to be confused with the use of the word ‘probe’ in 
Gravity Prob e B. 
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functionality and without costly flight build standards. Specifically, three full- 
sized payload system tests have been conducted with the three ‘probes’ of in- 
creasing fidelity just referred to (A, B, C), where Probe-A is a pre-prototype; 
Probe-B (designated as the backup flight probe) is essentially of flight -quality; 
Probe-C is the actual flight probe. These have been evaluated in the following 
successive ground-based tests: 

A. First Integrated Systems Test (FIST) 1990 1991. FIST consisted of 
a demonstration test of Probe-A in a semi-prototypical Engineering Devel- 
opment Dewar (EDD) with science-mission-like interfaces. Probe-A incorpo- 
rated a prototypical quartz block but no telescope and only two gyroscopes. 
It operates in a magnetic held of about I0“® T obtained with conventional 
shields. The use of only two gyroscopes made additional electrical and other 
connections available for instrumentation to investigat e the thermal and 
vacuum performance of the probe. 

B. Ground Test Unit GTU-0 1991—1994. GTU-0 was also performed with 

Probe-A and the EDD but with an ultra-low magnetic held shield T) 

installed in the dewar. Cold insertion of the probe into the dewar by means 
of the airlock resulted in low trapped flux in the gyroscopes. 

C. Ground Test Unit GTU-1 1994—1995. GTU-1 was the first use of 
Probe-B. It included: (a) operation with all four gyroscopes and a mass 
model of the telescope; (b) achieving a < 0.1 nT trapped dipole in the gyro 
rotor; (c) test of the fully coupled dc SQUID readout; (d) electromagnetic 
interference (EMI) tests; (e) operation at ultra high vacuum with a sintered 
titanium cryopump. Also performed at room temperature was a successful 
protoqual shake test of the probe with the te lescope mass model and caged 
gyroscopes, fully confirming the reliability of the design. 

D. Ground Test Unit GTU-2 1997. GTU-2 was performed again with 
Probe-B but mounted in the Science Mission Dewar (SMD). This provided a 
verification of a complete payload integration and test procedures using the 
SMD together with a full functional test of all subsystems (SIA, Probe and 
Dewar) before and after an integrated system shake test at protoqual levels. 

E. First Complete Payload Test August December 1999. Results of 
this test are discussed in Section 5. 

4 Risk Mitigation and Verification Matrix 

Developing a flight mission compels the physicist to address two interrelated 
sets of problems very different from traditional laboratory experience. An ap- 
paratus has to be made that will survive launch and function in the unfamiliar 
environment of Earth orbit or outer space. It also has to work even though not 
all aspects of its performance can be verified experimentally on the ground. Ad- 
dressing these two issues requires, in the first place, systematic thought about 
principles of risk mitigation and the c oncept of a verification matrix. Secondly 
it requires a well-ordered test plan to assure that the risks and uncertainties 
have indeed been properly accounted for. 




Gravity Probe B 



61 



From an engineering standpoint ‘risk mitigation’ means first that the exper- 
iment has to be designed conservatively to withstand the vibration of launch 
and the temperature conditions and charged particle environment encountered 
on orbit. In each case it is necessary to think not only of the design but of the 
testing necessary to verify that the completed payload meets the engineering and 
scientific requirements. Some of the following observations should be regarded 
as general space practice; others are spec ialized to Gravity Probe B. 

A. Surviving Launch: The Tradeoff Between Strength and Weight. 

The recipe for surviving the hostile launch environment might seem simple - 
extreme over-design so that the apparatus cannot possibly break. The flaw 
in this simple nostrum is that there is another constraint, weight. It seems 
almost a law of human nature that during the development of any spacecraft 
its weight will grow beyond the capability of the launch vehicle. Always 
at some point a strenuous program of weight reduction becomes necessary. 
In Gravity Probe B the largest contributing factor was the hea vy outer 
shell of the dewar, especially the two end-domes. After investigating several 
options, including the use of special low-density alloys we chose the well 
established but sophisticated method in which the inner surface is relieved 
with suitably-patterned rib structures (an ‘isogrid’ design). The final effect 
was a reduction of dewar weight of about 18 percent. Another frequently 
encountered weight penalty, surprising to the laboratory physicist, is from 
the electrical harnesses. Long experience ha s shown that these are always 
underestimated. In Gravity Probe B we were fortunate to begin harness 
layout early and were thus subject to less surprise than some programs have 
been. 

More generally, the physicist engaged in a space program is impressed to 
observe that through all the development a constant weekly watch has to be 
kept on weight and power budgets. 

B. Temperatures of Electronics Boxes: Two Issues and Their Reso- 
lution. The conditions under which electronics boxes have to function in 
space are very different from those typical of ground-based laboratories. In 
Gravity Probe B the surface temperature of the spacecraft is around 220 K 
(— 50°G). Also as it rolls around the line of sight to the star the electronic 
boxes will heat and cool in the presence of the Sun. This is a critical point in 
the design because it limits the accuracy of our earlier statement that rolling 
the spacecraft eliminates readout errors and gyro suspension torques. Obvi- 
ously, if the temperatures of key components vary at roll-rate the statement 
is no longer quite true. In building the electronics, therefore, thermal risks 
of two distinct kinds must be mitigated. They must have the engineering 
robustness to work under extreme conditions. They must also have the sci- 
entific refinement to maintain temperatures within a very narrow and known 
range over each roll cycle. Both constraints must be met in a situation where 
the electronics components are functioning in hard vacuum: any heat has to 
be removed by radiation or solid conduction, there is no atmospheric con- 
vection to aid us. These conditions require a ground verification program 
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of very considerable sophistication and have a critical influence on space- 
craft design. Key electronics boxes are enclosed in a Forward Electronics 
Enclosure (FEE) at the front end of the spacecraft with passive temperature 
control; within the individual boxes active temperature control is applied to 
certain particularly sensitive elements. As part of the on-orbit verification, 
temperature data is telemetered to ground. 

C. The Charged Particle Environment. The presence of the Van Allen 
belts at or a little above the 640 km Gravity Probe B orbit altitude places 
requirements on the GP-B electronics different from anything encountered 
on Earth. Particularly critical are the effects, well known to space engineers 
but less familiar to most physicists, of the South Atlantic Anomaly (SAA). 
The old discovery by Gauss in the 1840s that the Earth’s magnetic held 
can be represented by a tilted dipole displaced toward North modifies the 
lives of space scientists. Over the South Atlantic there is a region where 
for a period of 5 to 15 minutes the spacecraft e ncounters a much higher 
density of charged particles than elsewhere in its orbit. Since the Earth is 
rotating the SAA does not affect GP-B in every orbit; nevertheless there 
are several each day in which the charged particle environment is hostile. 
An obvious question is what effect this may have on the gyro and telescope 
readouts. Tests of the SQUID magnetometers in a special facility at the 
Paul Scherrer Institute in Villingen, Switzerland designed to provide in a few 
hours a particle dosage equivalent to a year’s passage through the SAA were 
pleasantly reassuring. The SQUIDS continued to function with only a small 
and acceptable number of measurable ‘flux-jumps’. On the other hand, with 
the telescope detectors some periods of interruption will certainly take place. 
Throughout the electronics ‘rad-hardened’ components must be used in all 
critical areas, and the design must be made ‘robust’ so that any temporary 
electronics failure will not have catastrophic effects. Most critical of all is the 
digital gyr o suspension system. To guard against upsets of the computers 
controlling the suspension it is designed with two parallel control computers, 
a backup analog suspension system, and an arbiter to monitor performance 
and determine which system shall be used. 

Such are the complexities of gyro suspension. In addition, it is necessary to 
prevent excessive charging of the rotor itself. For this purpose the charge 
on the rotor is measured by injecting a signal into the suspension system 
and the information so obtained is used to control an ultraviolet discharge 
mechanism based on the photoelectric effect. 

D. Electromagnetic Interference (EMI) and Compatibility (EMC). 
The issue of electromagnetic interference is familiar to physicists engaged in 
ground-based experiments. Nevertheless space in general, and Gravity Probe 
B in particular, have special problems. The compactness of the spacecraft 
means that unusual care is needed to insure that signals from the telemetry 
of the science data to Earth do not interfere with the payload electronics. 
There is also the bizarre-seeming fact that in low-Earth orbit signals from 
television stations on the ground ca n be a formidable problem. In Gravity 
Probe B a special issue is that the gyroscope readout based on the London 
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moment depends on accurately measuring very minute changes in magnetic 
fields and therefore extremes of magnetic shielding. In addition, each gy- 
roscope must be shielded from the next and the readout of each must be 
shielding from its own gyro suspension signals. Without superconducting 
shielding and frequency separation between the suspension and readout sig- 
nals, these requirements could probably never have been met. The fact that 
they were, as demonstrated in the tests of Probe-C discussed below, is one of 
the most reassuring results of the GP-B development. It is a success critically 
dependent on incremental prototyping. 

Mitigating risk is a total process requiring judgments of many diverse kinds. 
Risks need to be identified but they need also to be weighed intellectually and 
sometimes even literally. If a particular mitigation adds to the weight of the 
spacecraft, it may in that form be impracticable. Then there is the complicated 
question of complexity. Does the mitigation achieved by introducing a redun- 
dancy so increase the complexity that in the end it increases rather than reduces 
risk? All such issues have to be addressed in a context of cost, recognizing that 
in space programs cost penalties are of two kinds. There is the immediate cost 
of designing, fabricating, and testing some previously unplanned part or system 
but there is also the secondary, often far greater, problem of the ‘marching army’. 
Any schedule delay increases cost far more than one at first imagines because it 
entails keeping a large spacecraft team together longer than originally planned. 

Mitigation involves assessment and planning; it also involves verification. Re- 
flection on that fact leads to the important concept of a ‘verification matrix’. In 
any actual spacecraft there are many systems. Some have been flown many times. 
Others are new but very similar to systems that have been flown before. Others 
again are new but capable of being tested with greater or lesser completeness in 
the laboratory. Finally, others may not be capable of being tested at all before 
launch and can only be verifie d through calculation. The idea of a verification 
matrix is to lay out in a orderly way all the systems, their requirements and the 
nature of the verifications to be performed: inspection, similarity, analysis, test, 
and once that is done to establish, and where necessary enhance, the realism of 
the verification process. 

As an example consider Newtonian disturbances of the Gravity Probe B 
gyroscopes. Almost by definition gyro performance cannot be fully verified until 
GP-B is on orbit: if it could, one would be seeking to perform the experiment 
in a ground-based laboratory. That, however, does not preclude a rationally 
conceived and experimentally well-founded verification process. During the long 
history of Gravity Probe B an exhaustive error analysis has been conducted 
identifying no fewer than 120 different known class ical gyro torques, most of 
them exceedingly small. The verification questions are whether all these have 
been calculated correctly and whether any significant torques might have been 
left out. 

A broader view reveals that the torques can be classified in a number of ways 
which allow plausible comparisons of their relative values and aid in setting rea- 
sonable limits on unknown terms. One distinction is between support-dependent 
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torques (which decrease in space) and support-independent ones. Another is 
between torques due to pressures on the surface of the gyro rotor and those 
arising from momentum transfer through the volume of the material. A third 
is between torques varying linearly with time a nd those like Brownian motion, 
photon bombardment, and the impact of cosmic rays on the rotor which tend to 
cause ‘random-walk’ effects evolving as the square root of time. Simple as these 
distinctions are, they throw a flood of light on the expected performance of the 
experiment. 

Thus, the two obvious classes of torque involving surface pressure are those 
from gyro suspension and from residual gas in the housing. The ‘electrical pres- 
sure’ has to be such as to apply to the rotor accelerations of order 10“® g. At the 
planned operating gas pressure torr) the corresponding acceleration un- 

der the extreme assumption that all the pressure acts on one side of the rotor is 
g Hence, it is inherently plausible - and true - that gas-pressure torques 
are neg ligible and all our attention should be focused on electrical pressures, 
i.e. on the suspension torques. 

In fact, the suspension torques dominate the gyro performance. The analysis 
by Keiser and Silbergleit demonstrates that these electrical suspension torques 
are governed by 15 distinct coefficients. A laboratory verification, therefore, 
should evaluate each one and establish their consistency with other measured 
parameters, e.g. out-of-roundness of the gyro rotor, rotor centering, spin-axis 
alignment, etc. Measurements on a flight type gyroscope suspended against Ig, 
performed at Stanford by Dr. Yoshimi Ohshima, have successfully determined 
all 15 coefficients. 

With the coefficients known, classical electrical formulae based on Coulomb’s 
law determine the resultant gyro drifts to be expected on orbit. This is a powerful 
verification. Fig. 7, based on the analysis of G.M. Keiser and Silbergleit, gives the 
top eight non-relativistic terms contributing to the precession of the gyroscope 
in the plane of the frame-dragging effect. 

Of these no fewer than seven are support dependent torques and of the seven, 
four originate in imperfect roll-averaging due to heating and cooling of the elec- 
tronics as the spacecraft rolls in the presence of the Sun. We are thus led to 
a further verification. By determining in the laboratory the temperature coeffi- 
cients of key electronics systems, one can set criteria on their allowed tempera- 
ture variations in space. This, in turn, leads to an on-orbit verification through 
measuring and transmitting to E arth a record of the temperatures. It is even 
possible to use this record as a mode of on-orbit calibration allowing errors to 
be backed out by calculation after the fact. 

Although the argument from pressure eliminates the most obvious gas torque 
it does not guarantee that the residual gas has no effect. Molecules continually 
moving back and forth between the rotor and housing cause the gyroscope to 
slow down. If there is any asymmetry in this process it will result in a gyro 
drift due to ‘differential damping’. At 10“^^ torr the characteristic spin-down 
time of the gyroscope is 320,000 years. For GP-B the rolling of the spacecraft 
around the line of sight to the star h as a powerful symmetrizing effect and the 
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A. Support— dependent torques from temperature variations of suspen- 
sion electronics at spacecraft roll— rate (1,5, 7, 8) 

1. Roll-variation of preload voltage difference {ha — hh) 

5. Fixed sums of preloads {ha + hb, he) 

7. Roll-variation of sensing bridge voltages 

8. Roll-variation of preload sums {ha + hb, he) plus fixed miscentering of rotor 
Note: all above are conservative upper limits, awaiting calibration of flight Elec- 
tronics. Also, temperature variations of electronics will be measured On 
orbit. 

B. Support— dependent torques from non— temperature— dependent ef- 
fects (3,4,6) 

3. Gyro spin axis misaligned with spacecraft roll axis 

4. Gonstant transverse acceleration on gyro parallel to Earth’s gravity gradi- 
ent. Acting through misalignment between orbit plane and line of sight to 
star. 

6. Sagging of gyroscope at twice orbital period due to Earth’s gravity gradient 

C. Support— independent torque (2) 

2. London moment coupling to local magnetic shield-gyro axis misaligned with 
line of sight to star 

Fig. 7. The Top Eight Gyro Drift Torques. 



computed value of this differential damping torque is below 0.0001 mas/year. 
While this in itself is beyond ground verification, there is an important ground 
verification related to it, namely, to measure the characteristic spin down time 
of the gyroscope under laboratory conditions. The measurements confirm the 
accuracy (at low temperature) of the gas drag formula. They also provide a 
crucial check on whether any debris are present in the housing. 
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5 Probe-C Assembly March— August 1999 

Assembling the SIA and the Flight Probe has been a lengthy process, lasting 
more than 6 months. Individual systems and subsystems had to be fully qualified 
before assembly, all under the discipline of record keeping and Quality Assurance 
unfamiliar to most physicists. In total Gravity Probe B had 32 distinct mechani- 
cal and electrical systems that required verification through the Probe-C tests. It 
is only through dealing with an apparatus of this sophistication that the physicist 
comes to realize the benefi ts of the rigorous QA procedures developed through 
hard-won experience in the aerospace business. 

Among the systems tests that were performed before the assembly of the SIA 
and Probe-C began some of the more important were as follows: 

A. Gyro Qualification. To date, more than 110,000 hours of gyro testing 
have been conducted in a variety of cryogenic and room temperature test 
facilities. The final qualification of the individual gyroscopes to be inserted 
in Probe-C was carried out in a special Gyro Acceptance Facility designed 
to allow the verification that each gyroscope meets five flight requirements: 
(1) asymptotic spin speed > 100 Hz; (2) spin-down rate after evacuation 

< —2 mHz/hr (verifying gyro cleanliness); (3) low trapped magnetic field 

< 3 X 10“^° T; (4) discharge rate by ultraviolet system > 10 fA/^W (in 
combination with the charge measurement system, this will allow the mean 
rotor voltage to be kept < ±10 mV); (5) SQUID-dc coupling < 1% decay 
of 200 flux quanta after 15 min (verifying integrity of the readout loop). 

B. Qualification of SQUID Assemblies for Gyro Readout. Each SQUID 
assembly consists of a SQUID microchip sensor mounted on a sapphire circuit 
board with a superconducting thin-film input transformer, electronic com- 
ponents, and connectors, filter components for rejection of electromagnetic 
interference, all contained within a superconducting box made of “reactor 
grade” niobium and sealed with an indium alloy gasket. The SQUID chip was 
initially selected for low noise by cryogenic testing. It was then mounted into 
the SQUID assembly, which was subjected to cryogenic qualification testing 
to determine sensor noise, linearity, transfer function, temperature sensitiv- 
ity, bias current level, stability, the superconducting persistence of the input 
circuit, and magnetic and EMI attenuation. Only assemblies that meet all 
qualification criteria are accepted for use in Probe-C, and only SQUID sen- 
sors that have remained stable through multiple thermal cycles to cryogenic 
temperatures are incorporated in these assemblies. 

C. Magnetic Qualification of Probe Components. Control of the mag- 
netic properties of materials and assemblies used in Probe-C is essential for 
the London moment readout. To meet the necessary stringent requirements, 
a Magnetic Control Plan was set up dividing the Probe into magnetic zones 
based on distance from the gyroscope rotors. Within each zone, limits were 
placed on the residual magnetic moment and magnetic susceptibility of each 
component, and on the maximum level of stray ac magnetic field. Samples 
of the actual material stocks, from which probe components had been fabri- 
cated, were characterized magnetically at temperatures down to 2 K using a 
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Fig. 8. GP-B Telescope: The cylindrical structure in the center contains the image 
dividers. The mirror on top of it aids in telescope testings. The two DPA’s, each with 
redundant detectors, are mounted in the two structures set at 90 degrees of each other 
on the edge of the telescope. 



SQUID magnetometer. Once fabricated, the assemblies were further tested 
for magnetic cleanliness with a SQUID gradiometer to verify that machining 
and assembly operations did not introduce unacceptable magnetic contami- 
nation. 

D. Telescope Qualification. Fig. 8 illustrates the front end of the completed 
Flight Telescope, fabricated entirely of fused quartz and held together by 
a silicate bonding technique invented by Dr. Jason Gwo. It is a folded 
Cassegrainian system, of 3.75 m focal length and 0.15 m aperture, and a 
physical length of 0.32 m. Two star images are formed, one for each readout 
axis; the images are divided at two orthogonal roof prisms and the inten- 
sities of the divided beams are compared in redundant D etector Package 
Assemblies (DPA), operating at a temperature of about 70 K, mounted on 
the front end of the Telescope. 

The Flight Telescope was the third in a series whose evolution is another 
illustration of incremental prototyping. In fact, incremental prototyping ap- 
pears equally in testing the telescope testing. No fewer than three artificial 
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Fig. 9. SIA in Advanced Stage of Assembly: Note the suspension cables and two of 
the four SQUID boxes mounted on the upper side of the ‘bird-cage’. 



stars have been constructed to evaluate its performance. The one used for 
qualification was the second, on which a vibration-isolated dewar vessel was 
mounted to allow testing both at room temperature and low temperatures. 
Tests of the DPAs were performed separately to determine no ise perfor- 
mance prior to their installation on the telescope. Linearity measurements 
with the artificial star demonstrated that the telescope would remain linear 
to < 3 mas over a range of ±20 mas, meeting the requirements of spacecraft 
pointing control. 

With the individual systems for the Probe and SIA all qualified, it became 
possible to proceed to final assembly conducted in a class 10 clean room at Stan- 
ford. The Quartz Block/Telescope structure was inserted into an inner frame- 
work (the ‘bird-cage’) in the Probe, followed by the exacting process of insert- 
ing and aligning the gyroscopes into the Quartz Block with their centers within 
50 /im of a common line. (Hence, when the spacecraft has been mass-balanced 
on orbit, the gyroscopes will be w ithin that 50 /rm radial distance from the roll 
axis, minimizing the centrifugal acceleration to which they are subjected). Fig. 9 
illustrates the SIA in an advanced stage of assembly showing the suspension 
cables, SQUID assemblies, gyro caging line, and fiber optic cable for uv charge 
control. The final tests performed before inserting the Probe in the dewar were 
a room temperature spin test for all four gyroscopes, measuring the spin-down 
rate after evacuation and thus verifying once more that the gyr os were free from 
particle contamination, and a telescope field-of-view test. 
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Fig. 10. Probe-C in Airlock Ready for Placement on Science Mission Dewar. 



6 Probe-C Testing in Science Mission Dewar 
Angust— December 1999 

Because the ultra low magnetic field shield is maintained permanently in the 
inner well of the SMD, the process of inserting the Probe is, as remarked earlier, 
an unusual one. Whereas in almost all cryogenic systems the probe is inserted 
before the dewar has cooled down, here a method had to be developed for in- 
serting a warm probe into a cold dewar. In doing so, it is essential to avoid 
condensing any solid air into the dewar well. The method (devised by B. Cabr- 
era) is to make use of a cylindrical air lock within which the probe is fitted with 
a tight piston seal. By mounting the probe and air lock on top of the dewar 
it is then possible to pump out any air and backfill the air lock with helium 
gas prior to insertion. Insertion has to be performed sufficiently slowly not to 
generate excessive heat and warm up the superconducting lead shield. To aid 
in this process, the dewar well is filled temporarily with liquid helium prior to 
insertion. Fig. 10 shows Probe-C mounted in the airlock ready for placement on 
the dewar in its vertical position. 

Probe-C was inserted into the Science Mission Dewar on August 24, 1999 
and testing was continued until December 15. Fig. 11 shows the assembled probe 
and dewar under test at Stanford in a horizontal position. 

At an early stage it became clear that there was a significant, quite unex- 
pected anomaly in the thermal performance of Probe-C, which would require 
a recycling to room temperature. However, rather than immediately warming 
up we decided to ‘retire risk’ by completing the entire range of planned qualifi- 
cation tests on every one of the 32 major subassemblies contained in Probe C 
and the SMD. 28 out of 32 met or surpassed the stated requirements; 3 more 



70 



C.W.F. Everitt et al. 




Fig. 11. Probe-C under test in the SMD in a Horizontal Position: The dewar can be 
tilted through a range of angles to perform different tests. 



discrepancies (1 significant, 2 minor) were uncovered in a ddition to the thermal 
anomaly named above. These discrepancies were of a character that would not 
have prevented launching a successful mission but the necessity of warming up 
has given the opportunity to address them all. 

A. SIA. Tests of the SIA covered 7 major subassemblies and 1 systems test as 
follows: 

• Telescope: testing with artificial star ^ 3 (designed to mount on the 
SMD) verified performance under final operating conditions. The optical 
transmissivity and the strehl ratio (which measures the quality of the 
final image) surpassed the requirements. All 8 detectors in the two DPAs 
functioned correctly. The noise performance for the selected guide star 
HR 8703 meets the 10 mas/-\/Hz pointing requirement. 

• SQUIDs: all 4 SQUIDs function correctly, with noise performance meet- 
ing, and in 2 of the 4 surpassing, the requirement by a factor of 4. In 
separate tests of electromagnetic compatibility (EMC) with the Gyro 
Suspension System the shielding was shown to fully met stated require- 
ments. 

• Gyro levitation and spin-down: all 4 gyroscopes met the levitation re- 
quirements and the stated spin-down requirements, showing that at low 
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temperature as well as in the earlier room-temperature tests the gyros 
were free from particle contamination^. 

• Gyro caging: tests of the caging mechanism met requirements both at 
room temperature and low temperature. 

• Gyro uv charge control system: all 8 systems (2 per gyro for redundancy) 
functioned correctly yielding a photocurrent > 100fA//iW, surpassing 
the requirement by more than a factor of 3. 

• Gyro readout coupling: for 3 of the 4 gyros (## 1, 2, 3) the coupling of 
the SQUIDs to the readout loop met the superconducting requirement; 
for # 4 it showed resistance > 1,000 17 which is unacceptable. 

• SIA sensors and wiring: all of the thermometric and other sensors on 
the SIA functioned correctly, and all of the wiring met specifications; 
there was one electrical short on the gyro ^ 4 suspension shields, which, 
however, had no practical impact. 

• Trapped flux in gyro rotors: As noted in C below, the ambient field in 
the lead bag was < 2 x 10“^° T. Trapped fields in 3 of the 4 gyroscopes 
were below the 3 x 10“^*^ T requirement but in gyro ^ 1 (the one nearest 
the top of the bag) field levels were anomalously high (10.6 x 10“^° T). 

B. Probe-C. The 3 principal system entities: (1) plumbing; (2) sensors and 
wiring; (3) radiation windows, met their requirements, with the exception of 
minor leaks in the caging lines and one valve. The practical impact of these 
minor leaks would have been negligible but since the probe has been warmed 
up they are being repaired. Transmission through the windows exceeded 80%. 
The vacuum integrity of the probe fully met its requirement. The failure in 
thermal performance is discussed below. 

C. Dewar. The 4 principal system entities: (1) plumbing; (2) sensors and 
wiring; (3) low-temperature valves; (4) ultralow magnetic field shield met all 
their requirements. The magnetic field level in the lead shield was < 2 x 10“^° 
T. The vacuum integrity and thermal performance also met all requirements. 

D. Interfaces. Interface requirements had to be, and were, met for 3 systems: 
(I) SIA-to-Probe; (2) Probe-to-Dewar; (3) the Belleville-washer preload 
system to maintain thermal contact between Probe-C and the SMD. 

The foregoing tests were highly successful in retiring risk. At the conclusion 
of this phase of payload testing 90% of the required systems verifications had 
been completed. 

7 The Four Discrepancies 

and Their Resolution August 1999— June 2000 

Considerable work was done to characterize the four discrepancies while Probe-C 
was still in the SMD. After its removal on December 15, 1999 new diagnostic tests 

® At a late stage of testing, some contamination from frozen gas occurred in two of the 
gyros for reasons that are well understood. The gas disappeared on warming to room 
temperature, leaving the gyroscopes once more free of contamination. Pr ocedures 
are in place for preventing this contamination happening again. 
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became possible which further clarified the problems and various experiments 
and rehearsal tests were performed to validate the redesign. Rework commenced 
on March 27 and was completed on June 2, 2000. Reinsertion of Probe-C into 
the SMD will take place before October 18, 2000. 

A. Probe-C Thermal Anomaly. To understand the thermal anomaly one 
needs to grasp the principles governing the design of long hold-time helium 
dewars. 

Liquid helium, because of its quantum mechanical zero point energy, has 
very low latent heat. Far more cooling-power is available from the warming 
of the boil-off gas to room temperature than from the latent heat. In de- 
signing helium dewars one makes use of this fact by intercepting much of the 
incoming heat in ‘vapor-cooled shields’. With 3 to 4 suitably spaced shields 
embedded in the multilayer insulation, it is possible to recover about 35% 
of the available gaseous refrigeration and increase the dewa r hold-time by 
a factor of about 25. 

In Gravity Probe B this vapor-refrigeration must be applied not only to the 
dewar but also to intercept heat radiated and conducted down the neck- 
tube of the probe. Accordingly, as Fig. 12 shows, radiation windows at 3 
locations in the neck are connected to copper rings epoxied to its inner 
surface. Matching copper rings on the outside of the neck connect these 
thermally to corresponding cooled rings on the dewar neck. On orbit, where 
the dewar skin temperature is 220 K, the nominal temperatures of the 3 
w indows are 33 K, 77 K, and 132 K; a fourth ring (HEX ^ 4) without 
a window operates at 157 K. On Earth with a skin temperature of 293 K, 
the window temperatures are usually slightly higher; however, there is also 
a special operating condition used mainly on the launch pad, in which the 
lowest vapor-cooled shield is artificially cooled by means of a ‘guard tank’ 
full of ‘normal’ (i.e. non-superfluid helium) and therefore to operates at or 
near 4.2 K. In the experiments to be discussed now, this was in fac t the 
condition and window ^ 1 should have been at around 6 K. 

This sophisticated design was prototyped in Probes A and B, and success- 
fully demonstrated in both the EDD and the Science Mission Dewar. When 
Probe-C was cooled down, however, very serious temperature anomalies were 
observed. Instead of being at 6 K, as it should have been with the guard-tank 
cold, window ^ 1 ran at 86.5 K! The temperatures of windows # 2 and # 3 
and of Hex # 4 were also far too high. These drastic discrepancies strongly 
suggested that the windows were receiving effectively no cooling from the 
boil-off gas (in other words that there was a thermal disconnect somewhere 
between the window and the vapor-cooled shields. A variety of tests pointed 
to a thermal disconnect internal to Probe-C, probably between the copper 
rings and the fiberglass-epoxy neck tube. Conclusive evidence came from 
tests in which helium exchange gas was introduced into the probe at vari- 
ous pressures with the system vertical to prevent convection. At 0.31 torr 
the temperatures are reduced dramatically: the exchange g as provides an 
excellent conductive path between the inner ring and the neck tube. 
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Fig. 12. Schematic of Probe-C Repair: The three radiation windows can be seen within the neck in the main drawing. The left inset 
shows details of the window frame; the right inset is an enlarged section of the neck-wall and rings. 
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Location 


Expected 

(with guard-tank cold) 


Observed 
(high vacuum) 


Observed 
(0.31 torr) 


Window ^ 1 


6 K 


86.5 K 


26 K 


Window ^ 2 


22.5 K 


122.2 K 


31.5 K 


Window # 3 


72.3 K 


171.3 K 


77.3 K 


Hex ^ 4 


151.4 K 


196.3 K 


166.4 K 



The conclusion has to be that the epoxying that had worked so well for 
Probes A and B went wrong in Probe C. The very detailed written and 
photographic QA record of the construction showed no discernible cause of 
failure; nevertheless, failure there was, with three very serious implications. 
First, the loss of thermal conductivity reduces the dewar hold-time on orbit 
from 19 months to possibly as low as 10 months. Second, just as serious, 
it reduces the ground hold-time in such a way as to make operations on 
the launch pad almost impossibly difficult. Third and gravest of all, the 
break between heat stations and neck tube makes the probe vulnerable to 
mechanical failure under the vibration of launch. 

A failure of this kind puts a space program into a new situation. To disas- 
semble Probe-C and remake the epoxy bonds would take over a year and still 
not guarantee success. Instead, the method GP-B has adopted is to short- 
circuit the problem by a redesign which establishes direct thermal links be- 
tween the outer and inner copper rings. At each heat station four holes are 
drilled through the outer ring, the neck tube, and the inner ring ending as 
a ‘blind hole’ that stops just short of penetrating through to the far side 
of the inner ring. Thermal contact and mechanical integrity are established 
by means of copper pins inserted into each hole with a tight ‘press-fit’. For 
technical reasons two of the pins are inserted horizontally and two inclined 
at 51 degrees. Fig. 13 illustrates a cross-section of the pin and ‘blind hole’ for 
one of the 51-degree pins. Vacuum integrity, which has of course been lost 
by drilling through the neck tube, is restored by two stages of epoxy seal. A 
low viscosity epoxy is injecte d into the hole around the pin and over this, 
following a standard cryogenic practice, a ‘doubler’ in the form either of a 
flat copper sheet or copper plug is sealed over each pin with an extremely 
robust thermally matched epoxy rejoicing in the name “Blue Death” . 

To assure success, rehearsals were performed on two prototype assemblies 
with two alternative pinning procedures. Critical to the pinning solution is 
the complex drilling process which has to be undertaken to penetrate first 
a copper ring, then the composite neck tube, then the second ring, all with 
perfect alignment and without penetrating beyond the blind end of the hole. 
Since the pin has to be a ‘press-fit’, alignment and finish of the holes are 
critical: it has been found necessary to drill each hole in six stages, with three 
different kinds of drill. For the horizontal holes two of the stages required 
carbide drills which are notoriously brittle; great care had to be applied 
to avoid breaking off a drill partway through a hole. After many trials the 
procedure was perfected and demonstrated to meet the three essential re- 
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Fig. 13. Cross Section of Model of Probe Necktube with 510 Copper Pin. The section 
shows the pin in position with the blind hole in the inner copper ring. Not shown is 
the ‘doubler’ which completes the vacuum seal on the outside of the outer ring. 



quirements, thermal, mechanical, and vacuum. Comparative experiments on 
Probes B and C provided an unequivocal room-temperature diagnostic to 
distinguish acceptable and unacceptable the rmal performance at low tem- 
perature. Mechanical and vacuum integrity were validated as follows: Each 
prototype assembly was vacuum tested; subjected to a vibration test at flight 
qualification levels; vacuum tested again; twice cycled to low temperatures; 
vacuum tested for a third time. Both pinning techniques met all require- 
ments. Finally, on March 23, 2000 Probe C Redesign Readiness Review was 
held and approval given to proceed with the rework using the particular 
method discussed above. 

In applying to Probe-C the experience gained during the rehearsals, it was 
essential to take into consideration the issue of cleanliness. Debris or turnings 
from the drilling process must, at all costs, be prevented from contaminating 
the gyroscopes or soiling the surfaces of the telescope and radiation windows. 
The drilling was performed in an ultra high quality (class 10) clean room at 
Stanford, with the probe in an inverted position and at a slight overpressure 
to drive any contamination outwards. It was accompanied by a continuous 
vacuum cleaning around the hole. Drilling proceeded from the outside in, 
removing one radiation window after another to gain access for measurement 
of the position and alignment of the inner rings. 

In fact, the entire operation went remarkably smoothly. Pinning was com- 
pleted on May 15, 2000. After cleaning and reinstallation of the windows the 
following tests were performed by June 2: (1) a static load test on window 
frame ^ 3 at flight qualification levels; (2) the room temperature thermal 
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performance test mentioned above; (3) a vacuum integrity test of the entire 
probe. Prior to the installation of the pins the thermal conductance between 
inner and outer rings was 0.028 W/K, after pinning it was 0. 42 W/K. This 
factor of 15 in improvement was in excellent agreement with the data ob- 
tained with the earlier validation units. The computed helium hold-time on 
orbit (including a 30% allowance for conservatism) is 17 months. Vacuum 
performance was excellent: there was no detectable leak even on the most 
sensitive 3 x 10“^° standard cm^/sec scale of the leak detector, a margin of 
at least 4 orders of magnitude on the GP-B requirement. 

As of mid-June 2000 Probe C is being prepared for room-temperature spin 
tests of the gyroscopes to check whether cleanliness has been properly main- 
tained. 

B. Gyro # 4 Readout Ring Discrepancy. Beside offering a good oppor- 
tunity for scientific detective work, the readout failure for Gyro ^ 4 nicely 
illustrates the range of issues that need to be weighed in executing a space 
program. 

In the initial planning of Gravity Probe B the decision to have four gyro- 
scopes was in some degree arbitrary (a tradeoff between redundancy and 
complexity). Obviously, there ought to be more than one, but since each 
gyroscope will measure both relativity effects a case can be made for flying 
with only three. If there had been no other reason for removing the probe 
from the dewar that would almost certainly have been the decision. However, 
given the fact that the probe did have to be removed the option of re pair 
becomes worth careful consideration. 

The tests demonstrated that the entire SQUID readout unit was function- 
ing perfectly; the fault lay in the failure of a superconducting bond on the 
housing itself. The location of the high-resistance connection in the gyro-to- 
SQUID input circuit was determined by measuring the frequency dependence 
of the SQUID response to high frequency inductively coupled ac signals. Fre- 
quencies above 100 kHz are outside the SQUID feedback circuit bandwidth, 
so these measurements were made open loop (i.e., without feedba ck to the 
SQUID). To obtain consistent measurements, the SQUID was dc biased to 
the most linear portion of its characteristic curve. Input signals were limited 
to no more than one tenth of a SQUID flux quantum in order to obtain a 
reasonably linear output. The resulting data of SQUID output amplitude 
versus frequency were compared to the predictions obtained from SPIGE 
circuit simulation results based on a circuit model with a resistive contact at 
the gyroscope and a second model with the resistive contact in the SQUID 
assembly. The experimental data fit the former model excellently and the 
latter model not at all. 

It is worth emphasizing that the failure of the superconducting bond on gyro 
^ 4 was not entirely unexpected. During earlier qualifications of gyroscopes 
it had been found that the original design was somewhat unreliable and 
occasionally failed on initial cool-down. Accordingly, a new more reliable 
design of bond was developed, which has shown no failures, and this was 
used for gyros ^ # 1, 2, and 3. For gyro # 4 a compromise was made. A 
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gyroscope was available which was so excellent in other respects that we 
decided to use it despite having a bond of the original design, the argument 
being that in the past, once such bonds were qualified, they always withstood 
repeated temperature cycling. 

Repairing the bond simply means replacing the gyroscope. Although the SIA 
is designed to allow such a substitution, the process has never actually been 
tried and does entail risk. The cables and connections to the old gyroscope 
have to be severed and unstaked without damaging or contaminating the 
assembly; the gyro has to be removed and replaced; and numerous tests have 
to be performed prior to resealing the probe. Before finalizing the decision, 
a rework team has been set up to rehearse the process in a ba ckup Quartz 
Block Assembly containing two gyroscopes left over from earlier prototyping 
activities. The results are encouraging; a decision will be taken in June 2000. 

C. High Trapped Flux in Gyro # 1. The 10.6x 10“^° T trapped field in gyro 
# 1 was rather a surprise because in Probe-B, where less attention had been 
paid to magnetic cleanliness, there had been little difficulty in obtaining 
3 X 10“^° T trapped fields in all four gyros. Three possible explanations 
suggest themselves: (1) a change in magnetic field level in the lead bag; (2) 
magnetic contamination in the probe; (3) a thermal effect connected with 
the Probe-C anomaly. 

The first conjecture was that there had been a field change in the lead 
bag. The reason for suspecting this was that during cryogenic operation a 
‘thermal spike’ had occurred which could have heated the top end of the bag 
above its superconducting transition temperature. However, after removing 
Probe-C from the dewar and carefully remeasuring the field at all levels we 
found the field completely unchanged. Magnetic contamination remains a 
possibility, though in view of the extreme care taken in material screening it 
seems unlikely. The most probable explanation is a temperature effect. 

The hypothesized source is the Thomson effect. As is well-known, Thomson 
in the 1850s showed from thermodynamical reasoning that when any metal is 
subjected to a temperature gradient, a voltage difference will be established 
across it. What is less well-known is Thomson’s second discovery (1895) 
that if the metal is anisotropic circulating currents can be set up generating 
magnetic fields. At low temperatures, this effect becomes much larger as 
was discovered by a GP-B graduate student, P.M. Selzer, in 19 74. The 
argument, therefore, is that the thermal anomaly in Probe-C produces just 
such a temperature gradient in the metal around the SIA, at its upper end, 
and that the most likely cause of the anomalous field in Gyro # 1 is this 
Thomson-Selzer effect. Final verification will only be possible after insertion 
of the reworked probe into the SMD. 

Two possibilities therefore confront us. First that on cool-down the anomaly 
will have disappeared. Second that for magnetic contamination or some other 
reason the trapped field in Gyro ^ 1 will still be anomalously high, in which 
case the question is, how problematical that will be for the experiment. 
Detailed analysis by J.M. Lockhart demonstrates that while the 3 x 10“^° T 
nominal requirement remains a desirable goal, the readout is actually more 
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‘forgiving’ than had originally been thought, and there will be little or no 
loss in final accuracy of the measurement. 

D. Leaks in Caging Lines, etc. During the course of the reworking of Probe- 
C in the clean room at Stanford repairs were effected to the caging lines and 
these were further secured by the addition of ‘doublers’ similar in principle 
to the doublers on the pinning system. 

8 Spacecraft, Electronics Systems, Integration &; Test 

Reflection about Gravity Probe B rightly concentrates first on the Science In- 
strument with its cryogenic and mechanical refinements but no account of a 
space mission can end there. The following is a brief summary of the main other 
aspects of the mission. 

8.1 Spacecraft 

Gravity Probe B is unusual in that its main structural element is the Science 
Mission Dewar. To it is attached the Spacecraft, a welded aluminum framework 
that fits around the lower end of the SMD and carries the solar panels, harnesses, 
spacecraft electronics, and other support equipment Fig. 14. 

Mounted on the front end of the SMD in close proximity to Probe-G is a 
Forward Electronics Enclosure (FEE), which provides a highly stable thermal 
environment with extensive shielding against electromagnetic interference and 
coupling for the key payload electronics boxes. Also mounted on the SMD is 
the telescope sunshield. The combined weight of spacecraft and payload is 3241 
kg. The power requirements are 293 W for the spacecraft and 313 W for the 
payload, (606 W total) during normal operation; during initial setup, however, 
the requirement rises at certain intervals to 713 W. For an inertially fixed rolling 
spacecraft such as GP-B the power output from the sola r panels at any moment 
depends on their geometry and the orientation of the spacecraft with respect to 
the Sun. Allowance has to be made for degradation over the course of the mission. 
The system, as designed, has beginning-of-life and end~oDlife capabilities at 
worst-case seasonal minimum of 731 W and 703 W respectively. At the currently 
scheduled May 1, 2002 launch-date the actual capability is 873 W at launch 
declining to 737 W in early June, a sufficient margin over the 713 W initial 
requirement. 

As of June 2000 the spacecraft is 85% complete. Most of the subsystems, me- 
chanical and electrical, are flight-proven. In total there are 21 electronics boxes 
representing 12 distinct electrical systems all (except for Attitude/Translational 
Gontrol) flight-proven or close derivatives of flight-proven systems. All have 
been installed on the Spacecraft. All electrical harnesses are complete. Two and 
a half of the four solar panels are complete. The spacecraft flight software coding 
is complete and in veri fication test. 

Straightforward as most aspects of the GP-B spacecraft are, two of the me- 
chanical systems essential to the success of the relativity measurement are de- 
cidedly unusual. One is the use of helium proportional thrusters for attitude and 
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Fig. 14. The GP-B Spacecraft: The spacecraft fits around the lower end of the SMD. 
Among the items visible in the photo are two long rectangular boxes containing two of 
the seven mass-trim mechanisms. 



translational control; the other is having a set of 7 mass-trim mechanisms to bal- 
ance and align the principal axes of the spacecraft. Each mass trim mechanism 
consists of a 20 kg weight mounted in a closed rectangular box and adjusted in 
position by a lead screw driven by a ste pper motor. These mechanisms have 
been subject to a very extensive qualification program and are now installed on 
the spacecraft; 4 are mounted transversely to move the axis of rotation laterally 
until it coincides with the line through the center of the gyroscopes to within 0.8 
mm. The other 3 are parallel to the spacecraft axis and adjust the direction of 
the principal axes of inertia. Trimming is performed intermittently as the helium 
is depleted, probably a few times during the course of a year. 

In most spacecraft with gas-jet attitude control systems constraints on gas 
consumption set by weight dictate the use of on-off valves fired only on de- 
mand. To apply this method to GP-B with its very fine pointing requirement 

±20 mas) would take a space-qualified valve capable of reseating perfectly 
hundreds of millions of times, a severe reliability problem. Fortunately, there is a 
way. Already on board is a supply of gas that must be vented - the helium boil-off 
from the dewar. By directing this continuously through pairs of opposed nozzles 
operated as ‘proportional thrusters’ one obtains a control system that is at once 
smoother and mechanically more reliable than the conventional kind. A striking 
feature is the low Reynolds’ number 30) in the throat of the valve. Work by 
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a succession of GP-B graduate students, J.S. Bull, J.-H. Chen, P. Wiktor, and 
Y. Jafry, has led to a design subsequently further developed and space-qualified 
by Lockheed Martin that meets all of GP-B’s r equirements, provided the orbit 
altitude exceeds about 600 km. Fig. 15 illustrates the flight thrusters. 

Control is maintained in all three 
modes, pointing, drag-free, and roll, 
from signals derived respectively from: 

(1) the science telescope, (2) either of 
two science gyroscopes operated drag- 
free, (3) conventional rate-gyroscopes 
mounted on the spacecraft, updated 
by signals from a ‘star-blipper’ pick- 
ing up a band of stars spread over the 
heavens at an angle to the roll axis. 

The pointing accuracy good to about 
10 mas, the residual cross-track aver- 
age acceleration to better than 10“^^ 
g, the roll- -rate to about 1 part in 
10®. The translational controller is de- 
signed to force the center of rotation 
of the Spacecraft into coincidence with 
the line through the gyro centers. The mass-trim mechanism brings it within 
0.8 mm; the translational controller brings it to 50 /rm, that is, to within the 
limit to which the gyros are aligned. 

GP-B has 16 thrusters mounted on fixed struts extending out from the Space- 
craft, with geometrical and internal redundancies such that any four systems can 
be allowed to fail with no loss of control performance. 

8.2 Payload Electronics 

In contrast to the spacecraft electronics, based on flight-proven hardware, almost 
all of the payload electronics are new. Altogether there are 5 systems, containing 
a total of 15 electronics boxes distributed between the FEE and a corresponding 
Aft Electronics Enclosure (AEE) on the Spacecraft (the reason for this division 
between forward and aft enclosures is that if all the boxes were mounted forward 
it would unbalance the mass distribution of the space vehicle): 

• Telescope Readout Electronics (TRE), 2 boxes, both forward; 

• SQUID Readout Electronics (SRE), 4 boxes, 2 forward, 2 aft; 

• Gyro Suspension System (GSS), 8 boxes, 4 forward, 4 aft; 

• Experiment Gontrol Unit (EGU), 2 boxes, 1 forward, 1 aft; 

• Ultraviolet Discharge (UVD), 2 systems, 1 in each aft EGU box. 

Only the UVD has been flown before. 

An important lesson for the physicist in the world of space electronics is 
that even when an engineering model of a flight unit has met all requirements 




Fig. 15. Proportional Helium Thrusters 
(Flight Unit). 
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there is still a long way to go before the actual flight unit is ready for installa- 
tion. Procuring the right space-hardened components, assembling the flight unit 
under full Quality Assurance, and completing all the tests that are necessary, 
including vibration tests, is a major task. Among several common surprises is 
the extraordinarily long procurement t imes for ‘mil-standard’ components. For 
GP-B, engineering models of each unit have been fully tested in the laboratory 
but as of June 2000 the only flight unit available is the TRE. For the other three 
all components are in-house; the state of assembly is as follows. (1) SRE: all 
16 boards complete. Aft-box passed all functional tests except vibration test; 
forward-box in functional testing. (2) GSS: 10 of 14 boards assembled. For aft- 
box, all 5 boards tested, conformally coded, and ready for board int egration. For 
forward-box, 4 of 9 assembled, 2 completed testing, 1 also conformally coded. 
(3) EGU: all 19 boards assembled, tested, and conformally coded. Box testing 
ready to begin. 

The most challenging to design of the Payload Electronics systems is the 
GSS. It has to support the gyroscope with very low voltage (about 0.1 V) in 
order not to generate suspension torques yet be able to switch instantly to a 
higher level in emergencies. During gyro spin-up it has to exert an acceleration 
of approximately 0.3 g on the rotor to balance the pressure from the spin-up gas. 
Also during spin-up the center of the rotor has to be displaced in a controlled 
manner toward the spin-up channel to r educe gas leakage over the wall of the 
channel. All these operations have to be performed automatically without ever 
allowing the spinning ball to touch the housing. The final design is a digital 
system with two separate computers and an analog back-up, with an arbiter to 
decide which of the three parallel systems should take command. Testing it on 
Earth has been an interesting task. In addition to high voltage operation with 
live gyroscopes, a gyro test bed has been constructed in which variable flat plat 
e capacitors driven by piazoelectric actuators mimic the actual gyroscope under 
controlled conditions. 

The current schedule has the GSS completed by December 2000. The GSS 
and SRE will be used in Probe-G evaluation on the SMD at Stanford early 
in 2001 prior to shipment of the completed Payload to Lockheed Martin for 
integration with the Spacecraft in April 2001. 

8.3 Integration & Test, Ground Station, and Launch 

The process of testing a completed payload and spacecraft is in general terms 
well-established, though logistically it involves complications that a physicist 
might hardly expect. For Gravity Probe B, in addition to elaborate acoustic, 
modal, and end-to-end thermal vacuum tests, very careful spin balancing of the 
spacecraft is necessary to bring it within the range of the mass-trim mechanisms. 

The most unconventional aspect of the GP-B test program is the use of an 
Integrated Test Facility (ITF) to check software and command/ telemetry signals 
at an early stage prior to transmitting new untested signals to the Spacecraft. 
From the software point of view GP-B is an unusually complex spacecraft with 
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7 separate on-board computers, as well as an unusually large number of elec- 
tronics boxes. To mitigate the difficulty the ITF was commenced very early in 
the program and brought close to completion in 19 98, though even now not all 
the hardware is installed for verifying performance of the 1553 computer and 
command and telemetry signals. A constant difficulty in any test program is to 
separate problems in the electronics from problems in the Test Facility itself. 
A method of alleviating this has been devised. It consists of mounting in the 
ITF the already tested engineering models of the different electronics systems, 
so that it is possible to switch back and forth between them and the flight units. 

No less challenging has been the development at Stanford of the Mission Op- 
eration Center (MOC) and ground control software. The MOC communicates 
through NASA Wallops Flight Facility’s Ground Network to a series of ground 
stations located around the Earth. The primary stations in the Ground Net- 
work are the Alaska Ground Station and the Svalbard, Norway Ground Station. 
During set-up and, if necessary, emergency the Ground Network can be sup- 
plemented by the NASA Tracking and Relay Satellite System (TRSS) Spac e 
Network. And, finally, there is all the planning for the launch. The elaborate 
preparations required, including safety, preparation of ground support equip- 
ment, training of personnel, and the transportation of the spacecraft to the 
launch site, are a whole story to itself. 
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Abstract. The main theoretical aspects of gravitomagnetism are reviewed. It is shown 
that the gravitomagnetic precession of a gyroscope is intimately connected with the 
special temporal structure around a rotating mass that is revealed by the gravitomag- 
netic clock effect. This remarkable effect, which involves the difference in the proper 
periods of a standard clock in prograde and retrograde circular geodesic orbits around 
a rotating mass, is discussed in detail. The implications of this effect for the notion of 
“inertial dragging” in the general theory of relativity are presented. The theory of the 
clock effect is developed within the PPN framework and the possibility of measuring 
it via spaceborne clocks is examined. 



1 Introduction 

The close formal similarity between Coulomb’s law of electricity and Newton’s 
law of gravitation has led to a description of Newtonian gravitation in terms of a 
gravitoelectric field. The classical tests of general relativity can all be described 
via post-Newtonian gravitoelectric corrections brought about by relativity the- 
ory. Moreover, any theory that combines Newtonian gravitation and Lorentz 
invariance in a consistent framework must involve a gravitomagnetic field in 
close analogy with electrodynamics. The gravitomagnetic field is generated by 
the motion of matter. For instance, the mass current in the rotating Earth gen- 
erates a dipolar gravitomagnetic field that has not yet been directly observed; 
in fact, the main objective of the GP-B is to measure this field in a polar Earth 
orbit via the gravitomagnetic precession of superconducting gyroscopes on board 
a drag-free satellite. 

Gravitomagnetism had its beginning in the second half of the last century. 
Developments in electrodynamics led Holzmiiller [1] and Tisserand [2] to pos- 
tulate the existence of a solar gravitomagnetic field [3]. In fact, attempts were 
made to account for the excess perihelion precession of Mercury since the plan- 
etary orbits would be affected by the gravitomagnetic field of the Sun. However, 
the excess perihelion precession of Mercury was successfully explained by Ein- 
stein’s general relativity theory in terms of a small relativistic correction to the 
Newtonian gravitoelectric potential of the Sun. It was later shown by Thirring 
and Lense [4,5] that general relativity also predicts a certain gravitomagnetic 
field for a rotating mass, but the magnitude of this field in the solar system is 
generally small and would lead to a retrograde precession of the planetary orbits. 
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This Lense-Thirring precession of planetary orbits is too small to be detectable 
at present. 

For the purposes of confronting the theory with observation, gravitomag- 
netic phenomena are usually described in the framework of the post-Newtonian 
approximation; however, it is possible to provide a fully covariant treatment of 
certain aspects of gravitoelectromagnetism [6,7]. In fact, extensions of the Ja- 
cobi equation (i.e. the relativistic tidal equation) may be employed to identify 
the gravitoelectric and gravitomagnetic components of the curvature tensor in 
close analogy with the Lorentz force law. This analogy is incomplete, however, 
since the purely spatial components of the curvature tensor do not in general 
have an analog in the electromagnetic case; in fact, this is expected since linear 
gravity is a spin-2 field in contrast to the spin-1 character of the electromagnetic 
field. 

Some of the main theoretical aspects of gravitomagnetism are discussed in 
Section 2. We then turn our attention to how gravitomagnetism affects the 
spacetime structure in general relativity. Of primary importance in this con- 
nection is the gravitomagnetic clock effect, which in its simplest form may 
be formulated in terms of the difference in the proper periods of two clocks 
moving on the same circular orbit but in opposite directions about a rotat- 
ing mass. Let t+(t_) be the period for prograde (retrograde) motion, then for 
r ^ 2GM/c^,t+ — t_ « A'kJ/{Mc^). To lowest order, this remarkable result is 
independent of Newton’s constant of gravitation G and the radius of the orbit 
r. The effect and its consequences are discussed in Section 3 for circular equato- 
rial orbits in the Kerr geometry and the intimate connection between the clock 
effect and the gravitomagnetic gyroscope precession is demonstrated. The PPN 
approximation for this effect is developed in Section 4 and a brief discussion of 
its observability is given in Section 5. The sign of the clock effect is quite in- 
triguing, as it implies that prograde equatorial clocks are slower than retrograde 
equatorial clocks. This is completely opposite to what would be expected on the 
basis of “inertial dragging”. In fact, gravitomagnetism is historically connected 
with the question of the origin of inertia as this was Thirring’s motivation in 
his original paper on gravitomagnetism [4] . The present status of the problem of 
inertia is the subject of Section 6. Finally, Section 7 contains a brief discussion. 

Unless specified otherwise, we use units such that G = c = 1 for the sake of 
convenience. 

2 Gravitoelectromagnetism 

This section is devoted to a brief discussion of certain essential theoretical aspects 
of gravitoelectromagnetism. The Larmor theorem has played an important role 
in the field of magnetism; therefore, we begin by an account of the gravitational 
analog of Larmor’s theorem. 
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2.1 Gravitational Larmor Theorem 

A century ago, Larmor established a theorem regarding the local equivalence 
of magnetism and rotation [8]. That is, the basic electromagnetic force on a 
slowly moving particle of charge q and mass m can be locally replaced in the 
linear approximation by the inertial forces that arise if the motion is referred 
instead to an accelerated system in the absence of the electromagnetic field. 
The translational acceleration of the system is related to the electric field, = 
— (g/m)E, and the rotational (Larmor) frequency is related to the magnetic 
field via u>l = q^/{2mc). The charge-to-mass ratio is not the same for all 
particles; otherwise, a geometric theory of electrodynamics could be developed 
along the same lines as general relativity. It turns out that in general relativity 
one can provide an interpretation of Einstein’s heuristic principle of equivalence 
via the gravitational Larmor theorem [9] . This is due to the experimentally well- 
established circumstance that the gravitational charge-to-mass ratio is the same 
for all particles. Einstein’s heuristic principle of equivalence is usually stated 
in terms of the gravitoelectric field, i.e. the translational acceleration of the 
“Einstein elevator” in Minkowski spacetime. The gravitational Larmor theorem 
would also involve the gravitomagnetic field, i.e. a rotation of the elevator as 
well. 

It follows from the theoretical study of the motion of test particles as well 
as ideal test gyroscopes in a gravitational field that in general relativity the 
gravitoelectric charge is qE = —m, while the gravitomagnetic charge is qs = 
—2m; in fact, qs/qE = 2 since general relativity involves the tensor potential 
5^1/, i.e. (linear) gravitation is a spin-2 field. Thus = E and ujl = —^jc in 
this case. Indeed '&jc= flp is the gravitomagnetic precession frequency of an 
ideal test gyroscope at rest in a gravitomagnetic field, i.e. far from a rotating 
source dS/dt = flp x S, where 

C T 

( 1 ) 

and J is the total angular momentum of the source. Let us note that a gyro spin 
is in effect a gravitomagnetic dipole moment that precesses in a gravitomagnetic 
field. Locally, the same rotation would be observed in the absence of the gravit- 
omagnetic field but in a frame rotating with frequency ivp = — flp in agreement 
with the gravitational Larmor theorem. 

It is the goal of the GP-B to measure the gravitomagnetic gyroscope preces- 
sion in a polar orbit about the Earth and thereby provide direct observational 
proof of the existence of the gravitomagnetic field [10]. 

2.2 Gravitoelectromagnetic Field 

Let us consider the gravitational field of a “nonrelativistic” rotating astronomical 
source in the linear approximation of general relativity. The spacetime metric 
may be expressed as where iq^i, is the Minkowski metric. We 
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define h^i, = h^i, — where h = then, the gravitational field 

equations are given by 

7tG 

□^1/ = (2) 

where the Lorentz gauge condition h ^'' = 0 has been imposed. We focus atten- 
tion on the particular retarded solution of the field equations given by 



h - 

— ^4 



Tf^ui.ct - |x - : 



i>yj. 



( 3 ) 



where the nature and distribution of the “nonrelativistic” source must be taken 
into account. 

We are interested in sources such that /iqo = hot = —2Ai!(? and 

hij = where ^(t,x) is the gravitoelectric potential, A(t,x) is the grav- 

itomagnetic vector potential and we neglect all terms of order and lower 
including the tensor potential hij{t, x). It follows that j(? = p is the effective 
gravitational charge density and /c = j* is the corresponding current. Thus, 
far from the source 






GM 
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A 



G J X r 

c ’ 



( 4 ) 



where M and J are the total mass and angular momentum of the source, respec- 
tively. It follows from the Lorentz gauge condition that 



1 

c dt 



+ V- 




= 0 , 



( 5 ) 



since the other three equations = 0) all involve terms that are of 0{c~‘^) 

and therefore neglected. The spacetime metric involving the gravitoelectromag- 
netic (“GEM”) potentials is then given by 

—ds^ = —c^ ^1 — dt^ — - (A • dx)(it -|- ^1 -I- Sijdx^dxG (6) 



The GEM fields are defined by 



in close analogy with electrodynamics. It follows from the field equations (2) and 
the gauge condition (5) that 
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which are the Maxwell equations for the GEM field. Using classical electrody- 
namics as a guide, one can investigate the various implications of these equations 
[11]. A thorough approach to the determination of the gravitomagnetic field of a 
rotating mass (such as the Earth) is contained in the papers of Teyssandier [12]. 

The fact that the magnetic parts of equations (8) - (11) always appear with a 
factor of 1/2 as compared to standard electrodynamics is due to the circumstance 
that the effective gravitomagnetic charge is twice the gravitoelectric charge. That 
is, Qe = M and Qb = 2M are the effective gravitoelectric and gravitomagnetic 
charges of the source. 

The linear approximation of general relativity involves a spin-2 field. This 
field, once its spatial components are neglected, can be interpreted in terms of 
a gravitoelectromagnetic vector potential. To sustain the electromagnetic anal- 
ogy, however, we need to require that the gravitomagnetic charge be twice the 
gravitoelectric charge. This factor of 2 is a remnant of the spin-2 character of 
the original field, while for a pure spin-1 field (i.e. the electromagnetic field) the 
ratio of the magnetic charge to the electric charge is unity. 

The equation of motion of a test particle of mass m in this linear gravitational 
field can be obtained from the variational principle S f Cdt = 0, where C = 
—mcds/dt is given by 



C = -7 



l-^-^ll 

C2 c2 ' 



V \ ^ 4 



-I 1/2 



using equation (6). To linear order in (p and A, one can write (12) as 



C = —T 



1 - 



1/2 



-I- mj 1 -I- 



^ 2 to 

<P yv • A. 



(12) 



(13) 



Let us note that the deviation of equation (13) from a free-particle Lagrangian 
is given to lowest order in v/c by — 2mA • v/c. This deviation would be 
of the form in electrodynamics; therefore, the slow motion of the test 

particle is very similar to that of a charged particle in electrodynamics except 
that here qe = —m and qb = —2m as expected. It thus follows from the geodesic 
motion of a test particle of mass m far from the source in this gravitational 
background that the canonical momentum of the particle is given approximately 
by P + (— 2m/c)A, where p is the kinetic momentum. In this electrodynamic 
analogy, the attractive nature of gravity is reflected in our convention of positive 
gravitational charges for the source and negative gravitational charges for the test 
particle. The gravitomagnetic charge is always twice the gravitoelectric charge as 
a consequence of the tensorial character of the gravitational potentials in general 
relativity. 

The gauge transformations 

A^A-k2VV', (14) 

leave the GEM fields (7) and hence the GEM equations (8)-(ll) invariant. The 
Lorentz gauge condition (5) is also satisfied provided = 0. However, the 
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quantity —qe^ + 9 bA • v/c in the Lagrangian is not invariant under the gauge 
transformation (14). The gauge invariance of this Lagrangian is restored, how- 
ever, if the gauge function if) is independent of time, dip/dt = 0. In this case, we 
can start from a coordinate transformation t ^ t — 4t/’(x)/c^ in the metric (6) 
resulting in the gauge transformations (14) with <1> left invariant. 

The gravitational field corresponding to the metric (6) is given by the Rie- 
mann curvature tensor 






(15) 



where hoo = 2^/c^ and htj = (2<?/c^)<5ij are gravitoelectric and of 0(c“^), 
while hoi = —2Aij(? is gravitomagnetic and of 0{c~^). The components of the 
curvature tensor as measured by the standard geodesic observers are given by 
^P’'P<y^(a)^{f 3 )^{'y)^'(S)’ where is the tetrad frame of the test observer. In the 
linear approximation under consideration here, is in effect equal to in 
the calculation of the measured curvature. The components of this tensor may 
be expressed in the form of a symmetric 6x6 matrix TZ = {'1Zab)i where A and 
B range over (01, 02, 03, 23, 31, 12); hence. 



n = 




(16) 



where £ and S are symmetric 3x3 matrices and B is traceless. We find that the 
electric and magnetic components of the curvature are given by 

£ij = ^^j,i + 0{c ^), 

and the spatial components are given by 

Sij = --^Ej^i+ -^{V ■'E)6ij + 0{c "‘). (19) 

That B is traceless is consistent with equation (9) and the fact that £ and 
S are symmetric is consistent with equation (10) at 0(c“^). It is therefore clear 
that gravitoelectromagnetism permeates every aspect of general relativity: the 
gravitational potentials (GEM potentials), the connection (GEM field) and the 
curvature. In the exterior of the rotating source, the spacetime is Ricci-flat and 
hence S = —£, £ is traceless and B is symmetric. These restrictions on the 
curvature are consistent with the GEM field equations (8)-(ll) in the source- 
free region. 

The general treatment of gravitoelectromagnetism presented here has been 
based on a certain approximate form of the linear gravitation theory and can 
be used in the theoretical description of many interesting gravitational phenom- 
ena. In particular, we use this formalism below to investigate the microphysical 
implications of the gravitomagnetic precession of spin. 



(17) 

(18) 
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2.3 Free Fall Is Not Universal 

The assumption that all free test particles fall in the same way in a gravitational 
field is reflected in general relativity via the geodesic hypothesis. That is, the 
worldline of a free test particle is an intrinsic property of the spacetime manifold 
and is independent of the intrinsic aspects of the particle. In this way, general 
relativity is a geometric theory of gravitation. This circumstance originates from 
the well-tested equality of inertial and gravitational masses. 

An important consequence of Einstein’s geometric theory of gravitation is 
the fact that an ideal test gyroscope would process in the gravitomagnetic held 
of a rotating source. Here we pose the question of whether all spins should 
“process” like a gyroscope; evidently, the treatment of intrinsic spin would go 
beyond classical general relativity. It follows from the consideration of spin- 
rotation-gravity coupling that the intrinsic spin of a particle (e.g. a nucleus) 
would couple to the gravitomagnetic held of a rotating source (such as the Earth) 
via the interaction Hamiltonian 



n = (Tf2p ( 20 ) 

such that the Heisenberg equations of motion for the spin would be formally 
the same as that of an ideal test gyro [13]. Intuitively, this interaction is due 
to the coupling of the gravitomagnetic dipole moment of the particle with the 
gravitomagnetic held just as would be expected from the electromagnetic anal- 
ogy. It follows from equation (20) that the particle is subject to a gravitational 
Stern-Gerlach force given by 



F = -V{cr ■ Up) (21) 

that is purely dependent upon its spin and not its mass and therefore violates 
the universality of free fall. 

The point is that a particle is in general endowed with mass and spin in addi- 
tion to other intrinsic properties; indeed, the irreducible unitary representations 
of the inhomogeneous Lorentz group are characterized by mass and spin. In its 
interaction with a gravitational held, the mass interacts primarily with the grav- 
itoelectric held while the spin interacts primarily with the gravitomagnetic held. 
Whereas the former dominant interaction is consistent with the universality of 
free fall, the latter is not. For instance, the bending of light by the gravitational 
held of a rotating source depends on the state of polarization of the radiation. 
The differential deflection of polarized radiation by the Sun is too small to be 
measurable at present. The predicted violation is also extremely small for a nu- 
cleus in a laboratory on the Earth: the weight of the particle is ic = ± e), 

depending on whether the spin is polarized vertically up or down and e ~ 10“^®. 
Thus the predicted violation of the universality of free fall is extremely small. 

It may still be possible to measure this relativistic quantum gravitational 
effect by detecting the change in the energy of a particle in the laboratory when 
its spin is flipped. This would require, for instance, significant refinements in 
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modern variations of NMR and optical pumping techniques, since 

is a factor of 10"^ below the sensitivity of recent experiments [14]. Here Lp is the 
Planck length, Lp = hG J is the angular momentum of the Earth and R is 
its average radius. The smallest detected energy shift is about 10“^^eV corre- 
sponding to a frequency shift of 2 nHz [14]. However, it appears that significantly 
lower energy shifts may soon be detectable [15]. 

To clarify the nature of the force (21), let us consider the motion of a classical 
spinning test body in a stationary gravitational field. Such a system is necessarily 
extended and thus couples to spacetime curvature resulting in a Mathisson- 
Papapetrou force 

T’a = (23) 

where is the spin tensor of the system, is the velocity vector such that 
= 0 and the spin vector is given by 

Sn = (24) 

For the calculation of Fa, it suffices to set, in the linear approximation, « 
(1, 0, 0, 0), « 0 and Then, Fq « 0 and 

F, « = -(^2p)p,F^ (25) 

in agreement with equation (21). Thus the existence of the force (21) may be 
ascribed to the intrinsic nonlocality of a particle in the quantum theory and 
hence the coupling of spin to the magnetic part of the spacetime curvature in a 
stationary field. 

It is important to remark here that our considerations are distinct from pro- 
posals to measure the classical spin-spin force as discussed in [11]. Our results ul- 
timately follow from detailed considerations of Dirac-type wave equations in the 
gravitational field of a rotating mass (see the references cited in [13]); however, 
one can arrive at equations (20)-(21) on the basis of certain general arguments 
such as the local isotropy of space, the extended hypothesis of locality and the 
gravitational Larmor theorem [13]. 

Assuming the approximate validity of equations (20)-(21), it would be diffi- 
cult to imagine a basic gravitational theory founded purely on the universality of 
free fall and Riemannian geometry. However, such a theoretical structure would 
clearly be an excellent effective theory in the macrophysical domain. 

2.4 GEM Stress-Energy Tensor 

Let us imagine a congruence of geodesic test particles in a gravitational field. 
Taking one of the test particles as the reference observer, how does the motion of 
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the other neighboring test particles appear to the fiducial observer? The result 
is best expressed in a Fermi coordinate system that is set up along the refer- 
ence worldline. Let = (r, X) be the Fermi coordinates of the test particles, 
while the reference observer is at the origin of spatial Fermi coordinates. Then 
the equation of motion of the test particles is given by the generalized Jacobi 
equation 



^ + ^Ro^oJX^ + 2 + (2 ^RokjoV^V'^ 

+ I ? ^RokjiV^V>^V^)X^ = 0, (26) 

which is valid to first order in the relative separation X and to all orders in the 
relative velocity V = dX/dr. Here ^Rap-^siT) are components of the curvature 
tensor as measured by the fiducial observer, i.e. they are the projections of the 
Riemann tensor onto the nonrotating tetrad frame of the reference observer. 
Neglecting the second and third order terms in the relative rate of separation, 
equation (26) can be written as the GEM analog of the Lorentz force law 



= 9bE-|- geV X B, (27) 

where qs = —m, qb = —2m and 

E, = ^i?o^o,(T)X^ Bi = -^eijk^R3koi{T)XK (28) 

It is important to notice that the spacetime interval in the neighborhood 
of the reference worldline can be expressed in Fermi coordinates as —ds^ = 
^g^i, dX^dX^, where 

^doo = -1 - ^Ro^OJ{T)X^X^ + ■■■ , (29) 

= ~l ^Rojik{T)X^X'^ + • • • , (30) 

= S^j - i ^R,kji(T)X^X^ + ■■■ . (31) 

Letting ^goo = —1 + 2^ and ^goi = —2Ai, we find that 

^Ro^ojX^X^, i (32) 



so that the corresponding GEM fields using equation (7) agree with the results 
in equation (28) to linear order in the separation X. One can verify directly that 
V • B = 0 and V x E = 0, so that the source-free pair of Maxwell’s equations 
are satisfied along the reference worldline. Moreover, it is possible to combine 
the GEM fields together to form a GEM Faraday tensor Fag, 

Fag = — ^RagoiX'', 



(33) 
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such that Fq^ = —E^ and Fij = eijkBj^ as in standard electrodynamics. Then the 
other pair of Maxwell’s equations is given by F°‘^,jj= where is 

easily obtained to linear order in X using equation (33). J“ is a conserved current 
such that Jq(t, 0) = — ^Rao/A-K along the fiducial trajectory. This treatment 
should be compared and contrasted with the linear approximation developed 
above, in particular, the GEM current is different here. 

It is now possible to develop the classical field theory of the GEM field in the 
Fermi frame; in particular, one can define the Maxwell stress-energy tensor and 
the corresponding angular momentum for the GEM field. Thus 

T“^(t, X) = ^ f F^sF'-^^^ (34) 

47T y 4 y 

is the Maxwell stress-energy tensor for the GEM field that is quadratic in the 
spatial separation and vanishes at the location of the fiducial observer. Physi- 
cal measurements do not occur at a point, as already emphasized by Bohr and 
Rosenfeld [16]; moreover, the fiducial observer is arbitrary here. Therefore, a 
physically more meaningful quantity is obtained by averaging equation (34) over 
a sphere of radius tL in the Fermi system. Here L is a constant invariant length- 
scale associated with the gravitational field. We find that 

< ToMgX) > = (35) 

where Cq is a constant numerical factor and 

Tfivpa = 2 4" ~ -^9p,uRa0p^R ■ (36) 

For a Ricci-flat spacetime, Tpi,p„ reduces to the Bel- Robinson tensor T^^p^; 
in this case, RaS-yS reduces to the Weyl tensor CaB-yS and in equation (36) 

= (A74te, with 

The magnitude of Cq depends on whether we average over the surface or the 
volume of the sphere; in any case, one can always absorb Cq into the definition 
of L. Thus the pseudo-local GEM stress-energy tensor may be defined for any 
observer with the tetrad frame A(* , as 

(q) 

In this way, the pseudo-local GEM energy density, Poynting flux and stresses 
are defined up to a common multiplicative factor. 

It is important to note that the spatial components of the curvature have 
been ignored in our construction of the GEM tensor T(^a){/ 3 )- For a Ricci-flat 
spacetime, however, the spatial components of the curvature are simply related 
to its electric components; therefore, the pseudo-local tensor defined via equation 
(37) using the Bel-Robinson tensor contains the full (Weyl) curvature tensor and 
is thus the gravitational stress-energy tensor. 

It follows from a simple application of these results to the field of a rotating 
mass that there exists a steady Poynting flux of gravitational energy in the 
exterior field of a rotating mass. 




Gravitomagnetism and the Clock Effect 



93 



2.5 Oscillations of a Charged Rotating Black Hole 

Imagine a black hole of mass M, charge Q and angular momentum J that is 
perturbed by external radiation. The black hole is stationary and axisymmetric; 
therefore, the perturbation is expressible in terms of eigenmodes V{r, 6) exp(— iwt 
+imj4>), where V depends upon the frequency of the radiation, the total angular 
momentum parameters of the eigenmode {j,rrij) and the spin of the external 
field. It turns out that for a Fourier sum of such eigenmodes, the response of 
the black hole far away and at late times is dominated by a superposition of 
certain damped oscillations of the form Aexp{—iujt), where u) = wbh~*^bh with 
Cbh > 0. For these quasinormal modes, the amplitude A depends, among other 
things, on the strength of the perturbation while w depends only on the black 
hole parameters (M, Q, J). Moreover, these black hole oscillations are in general 
denumerably infinite and are numbered as n = 0,1,2,---, such that n = 0 is 
least damped and /bh increases with n. The intrinsic ringing of a black hole 
is due to the fact that once perturbed, the black hole undergoes characteristic 
damped oscillations in order to return to a stationary state. 

The fundamental modes of oscillations of black holes were originally found 
by numerical experiments and initial attempts to explain the numerical results 
via the properties of black hole effective potentials were unsuccessful [17]. The 
solution of the problem was first given around 1980 [18]. This work provided 
the stimulus for many subsequent investigations by a number of authors [19]. A 
detailed discussion of black hole oscillations is contained in [20]. 

For the modes of oscillation of a charged rotating black hole, the only reliable 
results are for j > \rrij\ ^ 1. Expressions for (wbHjTbh) have been obtained in 
the case of j = \mj\ 3> 1 for a general Kerr-Newman black hole; however, the 
results have been generalized to the case of j > \mj\ » 1 only for a slowly 
rotating charged black hole [21]. To express (wbh, Tbh) in terms of (M, J, Q) in 
the latter case, let wk(t’) = {Mr~^ — be the “Keplerian” frequency 

for the motion of a neutral particle in a circular geodesic orbit of radius r about 
a Reissner-Nordstrom black hole of mass M and charge Q. Here we use Boyer- 
Lindquist type of coordinates for the Kerr-Newman geometry. Timelike circular 
geodesic orbits exist down to a null orbit of radius tm such that 2tn = 3M -|- 
(9M^ — Let ljn = WK(^’Af), then it can be shown that for a slowly 

rotating black hole 





wbh " 


i ±joJN + rUjQM, 


(38) 


where 




A \rN + M 










(39) 



is an effective black hole rotation frequency. The (2j -|- l)-fold degeneracy in the 
spectrum of oscillations of the spherical black hole is removed by its rotation. 
We note that is proportional to the gravitomagnetic precession frequency at 
rjq. Moreover, 

Abh « + 0 (2 - 3 ^) ' T , 



(40) 
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where n = 0, 1, 2, • • • is the mode number and Q* = 6MQ^/[rAr(9M^ — 8Q^)]. It 
is interesting to note that if the black hole is charged, the rotation removes the 
degeneracy of the damping factor as well. Moreover, in the formulas (38) and 

(40) if (wbhiI^bh) is a ringing mode, then so is (— WBH,f^BH)- These results are 
independent of the spin of the perturbing field, since they are valid for states of 
high total angular momentum j > \mj\ ^ 1. 

3 Structure of Time and Relativistic Precession 

Let us now return to the gravitomagnetic temporal structure around a rotating 
source. The gravitomagnetic clock effect involves a coupling between the orbital 
motion of clocks and the rotation of the source. On the other hand, the gravito- 
magnetic gyroscope precession occurs even for a gyroscope at rest in the exterior 
field of a rotating source. Nevertheless, there is a general physical connection be- 
tween relativistic precession and temporal structure. This is not surprising since 
the operational definition of time ultimately involves counting a definite period 
and simple precession is uniform periodic motion. It is the purpose of this section 
to explain this relationship. We do this in several steps in the context of Kerr 
geometry with 

—ds^ = —dt^ + ^ -I- S d9^ + (r^ -|- a^) sin^ 9 d(j)^ 

+ -^{c dt — as\v? 9 d(f)Y , (41) 

where 27 = -|- cos^ 9 and A = — Rgr + . Here Rg = 2GM / is 

the gravitational radius of the source and the Kerr parameter a = J/Mc is a 
lengthscale characteristic of the rotation of the source. For M = 0 and a yf 0, 
the spacetime given by (41) is flat as expected. For a = 0 and M 0, the metric 

(41) represents the Schwarzschild geometry. Finally, for a = 0 and M = 0 we 
have the metric of an inertial frame expressed in spherical coordinates {r,9,4>). 

Let us first imagine an accelerated observer in an inertial frame. Suppose that 
this observer carries along its worldline an ideal pointlike test gyroscope so that 
there is no net torque on the gyroscope and its spin axis is therefore nonrotating. 
To simplify matters, let us first assume that the path is a circle of radius r in 
the (x, y)-plane with its center at the origin of coordinates. According to the 
standard static observers in the inertial frame, the accelerated observer moves 
with uniform frequency w*z. The transformation between the inertial frame and 
the rest frame of the rotating observer involves a simple rotation of frequency w*; 
therefore, from the viewpoint of the standard (i.e. static) observers in the rotating 
frame a natural operational way to keep the direction of the gyroscope spin 
axis nonrotating is to imagine fixing this axis at some initial time with respect 
to the axes of the rotating frame, but then continuously rotating it backward 
with frequency w*. In this way, the spin direction would remain fixed in the 
inertial frame if the rotation of the observer were virtual. In reality, however, the 
observer’s proper time r is related to thy dr = dt (1 — where v = rw*; 
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hence, the backward rotation of the spin occurs with respect to the rotating 
observer’s proper time, i.e. with frequency LOt,{dt/dT). From the standpoint of 
the standard inertial observers, the time dilation causes the spin direction to 
overcompensate and hence the spin direction is not fixed but precesses with 
the Thomas precession frequency lot = —ujif{dt/dT) + w*. This amounts to a 
precession of frequency a;*( 7 — 1) in a sense that is opposite to that of the rotation 
of the comoving observer; moreover, the generalization to arbitrary acceleration 
can be simply carried out by means of the Frenet procedure. That is, a Frenet 
frame can be set up along the path of the observer in space; then, w* = v/R{t), 
where R{t) is the radius of the curvature at each instant of time t. 

The intimate connection between time dilation and Thomas precession in 
Minkowski spacetime can be extended to a gravitational field. Therefore, let us 
imagine next that the motion described above is the geodesic motion of a free 
test particle carrying an ideal test gyroscope around a spherical mass M. Let 
wk = d(j)/dt be the Keplerian frequency as perceived by static inertial observers 
at infinity; then, = GM/r^, where r is the Schwarzschild radius of the circular 
orbit. The proper frequency in this case is w = Twk, where F = dt/dr = (1 — 
‘iGM j . The gravitational time dilation involves the static “gravitational 
redshift” effect of —goo = 1 ~ 2GM/(?r in the Schwarzschild geometry as well 
as the azimuthal motion r'^ (dcj) / dt)"^ = GM/r resulting in the factor of 3 in 
r. This situation is reminiscent of the spin-orbit coupling in the motion of the 
electron around the nucleus in the hydrogen atom; however, there are subtle 
differences between the electromagnetic and gravitational cases. In this case, the 
spin precession frequency is given by the Fokker frequency wp = w — wk, and the 
sense of precession is in the same sense as the orbital motion. This gravitational 
analog of the Thomas precession has a simple and transparent explanation in 
terms of Einstein’s principle of equivalence. According to this heuristic principle, 
an observer O in a gravitational field is locally equivalent to an observer O' 
in Minkowski spacetime with an acceleration that is equal in magnitude but 
opposite in direction to the Newtonian gravitational “acceleration” of O. The 
gravitational (Fokker) precession is thus locally equivalent to Thomas precession 
with the direction of acceleration reversed. It follows that the Fokker precession is 
in the same sense as the orbital motion. For an arbitrary accelerated observer in a 
gravitational field with velocity and acceleration = Urt'^/dr, the 

nonrotating equation of motion for the torque-free pointlike spin vector = 

0) is 

d<?M 

= u''a,S'', (42) 

so that both Fokker and Thomas precessions would be present for accelerated 
motion in Schwarzschild geometry. 

The gravitomagnetic precession of a gyroscope is in a similar way related to 
the temporal structure brought about by the rotation of the source. However, 
the situation here is more complicated than the gravitoelectric Fokker precession 
since the temporal structure is affected by the coupling of orbital motion with 
the angular momentum of the source. 
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Specifically, let us imagine stable circular geodesic orbits in the equatorial 
plane of the Kerr source (41). It can be shown that 

where Tk = 27r/o;K is the Keplerian period of the orbit and a = a ijJkIc- Here 
the upper (lower) sign refers to a prograde (retrograde) orbit. It follows that the 
orbital period is given by 

t± = T^ {l±a) , T± =Tk (l±2a-3i?g/2r)i/^ (44) 

Let us first note that t+ — t- = 4TTafc and = 4aT^. Since r+ + t_ = 

2Tk + 0(c~^), we find that r+ — r_ « ^irajc. In fact, r+ — t_ monotonically 
decreases as a function of r and approaches Ai^ajc as r — >■ oo. Thus a prograde 
clock moves more slowly than a retrograde clock according to comoving observers 
as well as the standard asymptotically inertial observers at infinity; moreover, 
T+ — T_ « 47ra/c for r ^ Rg. This remarkable gravitomagnetic clock effect has 
been discussed in some detail in recent publications [22]- [27]. This classical effect 
is in some sense the gravitomagnetic analog of the topological Aharonov-Bohm 
effect; in fact, let us note that far from a finite rotating source r+ — r_ is nearly a 
constant independent of the “distance” r and the gravitational coupling constant 
G. These aspects of the clock effect have been discussed in detail elsewhere [26] . 

Let us now imagine two clocks moving in opposite directions on a stable 
circular geodesic orbit of radius r in the equatorial plane of the Kerr source. 
Suppose that at tg = 0; they are both at (j)Q = 0; let us denote the event at 
which the clocks next meet again by (ti,^i). It follows from equation (43) that 
27Tti = (()iTk(1 -I- a) for the prograde clock and 27rti = (27 t — (/)i)Tk(1 — a) for 
the retrograde clock. Thus </>i = 7 t(1 — a) and ti = Tk(1 — o?')j2. Moreover, 
T+((()i) — T^(<()i) = aT^(a^ -I- iRgl2r), which is negligibly small for clocks in 
orbit about astronomical sources in the solar system; in fact, r+{(j)i) — ~ 

0(c“^). The next time the clocks meet is at {t2,4>2), which bears the same 
relationship to as {ti,(j)i) to (to,4>o)] therefore, (p 2 = 27t(1 — a) and 

t 2 = 7k(l — a^). In general, the nth time the clocks meet is at {tn,4>n) with 
4>n = n7r(l — a) modulo 27 t and = nTK(l — o?)!‘2- 

Consider now the behavior of the diametrical line joining these events to the 
origin of the spatial coordinates. For a = 0, i.e. in the Schwarzschild case, this 
line is fixed as the clocks repeatedly meet at two diametrically opposite points. 
However, for a yf 0 the line precesses in the opposite sense as the rotation of the 
source with the precession frequency given approximately by (cf. Fig. 1) 



7 T - (/>1 
T+((/'l) 



GJ 



0(c-4), 



(45) 



which at this order is in agreement with the precession frequency of an ideal 
torque-free gyroscope that is fixed in the equatorial plane of the Kerr source 
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Fig. 1. Gravitomagnetic precession of the diametrical lines indicating the points 
{4>o,4>i^ 02 , • • • ) at which the clocks would meet. 



Our treatment of the clock effect has been limited thus far to circular orbits 
in the equatorial plane of the source. Off this plane, even circular orbits are 
not generally closed and the discussion of the clock effect as well as its intimate 
connection with the gravitomagnetic gyroscope precession becomes more com- 
plicated. In fact, the clock effect can be extended to such orbits using the notion 
of azimuthal closure [26]. 

Finally, it should be mentioned that the general motion of an ideal pointlike 
torque- free gyroscope in the Kerr field would, in accordance with equation (42), 
involve Thomas and Fokker precessions as well as a complicated gravitomag- 
netic motion that consists of both precession and nutation. Indeed, the notion of 
relativistic nutation has been introduced in the post-Schwarzschild approxima- 
tion scheme in order to describe the nutational part of the gravitomagnetic spin 
motion [29]. The complex gravitomagnetic spin motion reduces to the simple 
(Schiff) precession in the lowest post-Newtonian order. 

4 Clock Effect in the PPN Approximation 

In view of the possibility of detecting the gravitomagnetic clock effect via space- 
borne clocks, it is interesting to develop the theory of the clock effect within the 
parametrized post-Newtonian (PPN) framework. The PPN formalism contains 
a set of parameters that characterize different metric theories of gravitation in 
the post-Newtonian approximation. The general form of the PPN metric is de- 
scribed in [30,31]; it includes theories and effects that are not of primary interest 
for our treatment of the clock effect. Therefore we will start from a simplified 
PPN metric of the form 



300 = -1 + 21 / - 2 / 31/2 , 

301 = “^(47 + 4-1- ol\)H* , 

9ij = (1 + , 



(46) 

(47) 

(48) 
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which describes a rotating body in an underlying Cartesian coordinate system 
= {ct, q) with Q = (x,y,z). In the following, we will use spherical coordi- 
nates (g,9,(j)); the isotropic radial coordinate g should not be confused with the 
Schwarzschild radial coordinate r. In general relativity, the PPN parameters ai, 
(3 and 7 are given by oi = 0 and /3 = 7 = 1. 

The PPN metric (46)-(48) is restricted to theories that exhibit conservation 
laws for total momentum and ignores the Whitehead and preferred-frame effects 
[32] . We assume that the gravitational source is a uniformly rotating and nearly 
spherical body that is symmetric about the axis of rotation (i.e. the z-axis). We 
are interested in the exterior gravitational field of the source when its center of 
mass is at the origin of spatial coordinates. The gravitoelectric potential U{g,9) 
is given in this case by [ 12 ] 

7 \ /f 00 

U{g,9)=^\l + y^jJ^yP^{cos9)] , (49) 

where ge is the equatorial radius of the source, M is in effect the asymptotically 
measured mass of the source, is the Legendre polynomial of degree n and 

Jn := ^ / Kd. 9')g''^P^{cos9') d^g' . (50) 

Here g, denotes the effective mass-energy density of the source. In a similar 
way the multipole expansion of the gravitomagnetic vector potential H* can be 
expressed as [ 12 ] 

H:{g,9) = \l + f2KjyyP:,^,{cos9)] , (51) 

C Q L t ^ Q ■' 

n—1 

where J = Jz is in effect the asymptotically measured angular momentum of 
the source and P^ix) = dPn{x)/dx. Here 

2^7+3 ^ 

and 

Ln := —4+2 / Kd, O') g'^^+^ prices 9')d^g' . (53) 

IVl Qq j 

The derivation of the clock effect involves the computation of the proper time 
T over a complete azimuthal cycle along geodesic orbits about the source. For 
simplicity, we limit our discussion to circular geodesic orbits in the equatorial 
plane, i.e. g = constant and 9 = tt/2. 

The radial geodesic equation, corresponding to a circular orbit in the equa- 
torial plane, is given by 

g dx°" dxO 



(54) 
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which can be written as 



(cdt^ ^(cdt \ ^ ^ ^ 



(55) 



It is straightforward to show that = go 4 >,g/ 9 oo,g and = 

9<P<P,g/9oo,g- Using equations (46)-(48), we find that 



where 



= - ^ (47 + 4 + H {g, 6) sin^ 9 , 

00 



7 

H{g,9)= — 
c^g L 



(56) 



(57) 



and 300 = 7(1 + 2'yU) sin^ 9. The solution of equation (55) can then be written 
as 



d(j) 



-U, 



1 + (/3 + 7)C/ + ^76'U(, 
H , 



+ -(47 + 4 + a,)^ + 0(c-3). 



(58) 



Q 

It follows from the PPN metric —c^dr'^ = c^goodt"^ + 2cgo^dtd(j) + g<p^d4>'^ that 






dt N 2 1 






(59) 



Using equation (58), we find after some algebra that 



1 



dr 


Prr 


1 

2 


d(j) 


Q 





1 + (/3 + 7 — 1 )U + -gi'yU^g — |Ug|) 



+ 74, + 4 + o,)lh + O(c-0. 



Integration of this equation immediately yields t±; hence, 



"+-"- = 2 c 



^(47 + 4 + ai)|^ + 0(c-3) 



(60) 



(61) 



gives the gravitomagnetic clock effect within the restricted PPN framework 
adopted here. The explicit dependence of the gravitomagnetic clock effect on 
the PPN parameters is through the proportionality factor of (4y + 4 + oi); in 
fact, the clock effect has this feature in common with other main gravitomagnetic 
effects [32]. 

It is interesting to note that in general relativity the gravitomagnetic clock 
effect in the post-Newtonian approximation is given by 



T+ — r_ « 47 t 



J 






1 + Xd 2 - 



(62) 
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when the source is assumed to be symmetric about the equatorial plane and all 
moments higher than the quadrupole are neglected. Using data given in [12], 
we find that for the Earth J 2 « —10“^ and K 2 « —10“^, so that (3/2) J 2 — 
{9/2)K2 « 3 X 10“^ gives the relative contribution of the oblateness of the 
Earth to the clock effect for a near-Earth equatorial orbit. 

5 Detection of the Gravitomagnetic Temporal Structure 

According to Eq. (43) and the discussion following it, the orbital motion of free 
clocks around a rotating mass gives rise to the gravitomagnetic clock effect which 
shows up in the difference between the proper orbital periods of co- and counter- 
orbiting clocks. This is given by 47ra/c for equatorial trajectories. Inserting the 
specific angular momentum of the Earth (a ^ 3 m) yields an amazingly ’’large” 
value of r_|_ — T_ ~ 10“'^s. 

Despite this seemingly large effect, the actual measurement of this time dif- 
ference encounters severe practical difficulties. Since the two clocks are assumed 
to move along opposite but identical orbits, their Kepler periods exactly cancel 
upon forming the difference r+ — r_ , thereby revealing the gravitomagnetic clock 
effect. In reality, however, clocks cannot be injected into identical trajectories 
and the resulting difference in the Kepler periods will readily exceed the time 
difference induced by the rotation of the Earth. Since for near-Earth orbits a 
radial separation of 0.1 mm of the clocks involves a time difference in the Kepler 
periods of the same order of magnitude as the gravitomagnetic clock effect, the 
position of the clocks has to be known at the submillimeter level in order to 
filter the effect which is caused by the rotation of the Earth out of the data. 
Similary, as the satellite moves just under 1 mm within 10“ ^s along its track (or 
~ 10“^ milliarcseconds in angular distance), the azimuthal position has to be 
known at the same accuracy as the radial one. On the other hand, the gravito- 
magnetic time difference accumulates with the number of revolutions and after 
hundreds or thousands of periods a knowledge of the position of the clocks at 
the centimeter level will be sufficient to overcome the difference in the Kepler 
periods. 

Another difficulty arises from the determination of all the forces that act on 
the satellites carrying the clocks. Since the period of revolution for orbits of ~ 10^ 
km altitude is of the order of ~ 10^ s, accelerations as weak as 10“^^ m/s^ will 
already cover the gravitomagnetic clock effect. Among these forces, gravitational 
perturbations due to the nonsphericity of the Earth, solid and ocean Earth tides 
as well as the interaction with the Sun, Moon and planets will cause the most 
significant deviations from an ideal orbit. Depending on the altitude of the satel- 
lites, the atmospheric drag effect can also considerably change the shape of the 
orbit. Moreover, this latter effect is quite difficult to model because it strongly 
depends on the atmospheric density which is not only correlated to the orbital 
height but also subject to temporal variations. Other non-gravitational perturba- 
tions like solar and terrestrial radiation pressure, thermal thrust, charged particle 
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drag etc. must also be taken into account despite their less distinct influence, 
since they likely induce accelerations in excess of 10“^^ m/s^. 

In practice, the effect of all these perturbations will be modeled by deter- 
mining a precise orbit based on the actual spacecraft observations. This will be 
accomplished by generating an orbital trajectory following Newton’s equations 
of motion and by including all perturbing forces acting on the satellite, using 
the most accurate models available. In the next step, this predicted orbit has to 
be best fitted to the one observed, where some force parameters may be solved 
for during the orbit adjustment process in order to obtain an improved or tai- 
lored force model for the specific mission. From the resulting precise orbit the 
effects of the individual perturbations are removed step by step thus yielding a 
quasi-Keplerian orbit, but still carrying the signatures of the relativistic effects. 
Finally, a comparison with the corresponding clock predictions for an appropri- 
ate synthetic orbit is performed which is expected to confirm the clock effect 
being investigated. 

Therefore, in order to meet the very stringent conditions for the observation 
of the clock effect, many tiny perturbing sources have to be considered and 
investigated that are usually absent in most of the present orbit determination 
systems. 

6 Quantum Origin of Inertia 

The sign of the gravitomagnetic clock effect has a remarkable consequence that 
will be elucidated in this section. It follows from t+ > t- and r+ > r_ that the 
uniform motion of the prograde clock is slower than that of the retrograde clock. 
Thus in comparison with motion around a nonrotating mass, a rotating mass 
would “drag” free test particles along such that it would take longer (shorter) 
to go once around it on an equatorial circular orbit in the prograde (retrograde) 
direction. We may call this circumstance virtual “inertial antidragging,” since 
it is the exact opposite of what would be expected on the basis of the so-called 
“inertial dragging” [33]. In fact, as Fig. 2 clearly demonstrates, for a given r (i.e. 
fixed orbital radius), the faster the Kerr source spins, the slower the prograde 
motion and the faster the retrograde motion. 

Rotational or translational inertial dragging refers to the circumstance that 
an accelerating mass would somehow induce acceleration in the same sense in 
nearby masses as a consequence of the so-called “Mach’s principle” . The clock 
effect indicates that precisely the opposite situation is predicted for rotational 
motion in the equatorial plane by the general theory of relativity. Translational 
inertial dragging has been discussed by a number of authors [34]; again, such 
notions are foreign to the standard geometric interpretation of general relativity 
[35]. In general relativity, accelerated motion is absolute in the sense that it is 
nonrelative. Thus the term “absolute” as employed here only signifies the oppo- 
site of the term “relative” and is devoid of any metaphysical connotations. The 
gravitomagnetic clock effect and the gyroscope precession indicate the absolute 
rotation of the source. That is, a direct gravitomagnetic verification of Einstein’s 
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Fig. 2. Plot of the clock rate versus the rotation of the source. Imagine an ensemble 
of Kerr fields with the same mass but different angular momenta. For a fixed stable 
circular geodesic orbit with “radius” r, r+(r_) monotonically increases (decreases) over 
the ensemble with increasing angular momentum of the source. Stable orbits of this 
type occur for r > r±, where r± — 3Rg ± 4a (2Rg/r±y^^ = 3a^/r±. Let us note that 
r_ > r+ and the equality occurs for a = 0 and r± = 3Rg; moreover, r±ja ^ (3)^^^ as 
2a/ Rg — >■ 00 . The graph illustrates the behavior of t± for an ensemble of Kerr black 
holes with 2a < Rg; for 2a — Rg, r+ = a and r_ = 9a. In the graph, tq = t± {a = 0) 
and the radius r is chosen to be ^Rg. 



theory of gravitation — e.g. via NASA’s GP-B — would constitute observational 
proof that the rotation of the Earth is absolute and not merely relative to the 
distant matter of the universe [36]. 

Mach’s profound analysis of the foundations of Newtonian mechanics occa- 
sioned a thorough re-examination of the basic classical notions of space, time and 
motion that had been prevalent since Newton provided a rational basis for the 
Copernican revolution and Kepler’s laws of planetary motion. This re-evaluation 
culminated in Einstein’s theory of general relativity. It is therefore of great im- 
portance to recognize that general relativity — which agrees with all experimental 
data to date — does not contain the idea of relativity of arbitrary motion. That is, 
this concept — which was so crucial in the historical development of Einstein’s 
theory — is absent in the standard geometric interpretation of general relativity 
in the sense that it is neither a part of the foundations of the theory nor follows 
from it. The re-emergence of absolute motion may be taken to mean that gen- 
eral relativity is still not completely devoid of certain “metaphysical” elements. 
Should general relativity therefore be modified or abandoned in favor of a the- 
ory based on the relativity of arbitrary motion? To do so would be unwise. One 
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should recognize instead that physics has progressed far beyond the early days 
of relativity theory and the observational successes of general relativity must 
now be integrated within a quantum framework that involves the vacuum state 
of microphysics as well as the rest frame of the cosmic microwave background 
radiation. 

Mach noted that in Newtonian mechanics, the intrinsic state of a classical 
particle characterized by its mass m has no direct connection with the extrinsic 
state of the particle characterized by its position and velocity (x, v) in absolute 
space and time. Hence the same extrinsic state can be occupied by other masses 
comoving with the particle. Thus an observer can change its perspective by 
comoving with each particle in turn. In Newtonian mechanics, the particles are 
thus “placed” on the absolute space and time continuum and remain external 
to it. On the other hand, classical particles are “connected” to each other via 
interactions such as gravity and electromagnetism. Mach therefore concluded 
that only the motion of a particle relative to other particles should have ultimate 
physical significance. Mach’s basic analysis has been restated in modern form in 
[37]. 

In classical physics, motion takes place via classical particles as well as elec- 
tromagnetic waves. It appears that Mach did not extend his analysis of classical 
particle motion to electromagnetic wave propagation; in this connection, the is- 
sues that arise in the examination of the historical record are briefly mentioned 
in the Appendix. Let us therefore apply Mach’s argument to the motion of elec- 
tromagnetic waves. The intrinsic aspects of the wave are its amplitude, period, 
wavelength and polarization, which therefore characterize its intrinsic state. The 
extrinsic state of the wave is given by its wave function <f"(t,x) in absolute time 
and space, and we note that the wave’s intrinsic state is directly related to its 
extrinsic state, i.e. electric and magnetic held components, since the former can- 
not be defined independently of the latter. The conclusion is that the motion of 
classical electromagnetic waves is absolute, i.e. nonrelative. 

Classical motion can be either relative or absolute. In Einstein’s discussion of 
the so-called “Mach’s principle,” only “ponderable” masses are considered [33], 
whereas classical motion occurs via classical particles as well as electromagnetic 
waves. It is natural to think of the motion of classical particles (i.e. “ponderable” 
masses) as relative, since one can change one’s perspective by moving with each 
mass in turn. In the same sense, the motion of electromagnetic waves must be 
considered absolute due to its observer-independent status. The development 
of these simple notions taking due account of wave-particle duality leads to 
the principle of complementarity of absolute and relative motion [38]. In this 
connection, let us note that Mach’s analysis of classical particle motion may 
be restated in terms of the complete kinematic independence of the absolute 
position X of a particle from its momentum p = mv. In contrast, quantum 
kinematics can be consistently formulated only by imposing the fundamental 
quantum condition on the operators characterizing the position and momentum 
of a particle in absolute space and time, i.e. [xj,pk] = ihSjk- For instance, in the 
nonrelativistic motion of a free particle in the Heisenberg picture p = mv and 
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[xj,Vk] = ihm Thus in contrast to the situation in classical mechanics, the 
mass of a particle is related to its position and velocity in quantum mechanics due 
to the fact that the particle has wave characteristics as well. This idea naturally 
extends to the specific orbital angular momentum of the particle, U = eijkXjVk, 
so that [lj,lk] = ifim~^ejknln- In the limit of an infinitely massive particle, 
the connection disappears and the position and velocity commute; that is, one 
recovers classical mechanics when the system is so massive that the perturbation 
due to an act of observation on the system is negligible and the system therefore 
behaves classically. 

Mach’s argument involves classical quantities (c-numbers), whereas the quan- 
tum condition involves operators (q- numbers); nevertheless, the quantum con- 
dition implies that the intrinsic and extrinsic aspects of the particle are directly 
related through Planck’s constant. For instance, in the Schrodinger picture the 
extrinsic state of the particle is given by the wave function <F(t,x) and the 
Schrodinger equation for S' involves m, which characterizes the intrinsic state 
of the particle in Mach’s analysis. The relationship under discussion here is not 
merely formal but can be verified observationally. In fact, this kinematic con- 
nection is particularly well illustrated by the example of a free particle passing 
through a slit. The resulting diffraction angle is inversely proportional to the 
mass of the particle, so that the diffraction is absent in the limit of large mass 
and the particle behaves classically. To the extent that classical mechanics can be 
thought of as a limiting form of quantum mechanics, the epistemological prob- 
lem of Newtonian mechanics — so clearly brought out by Mach — disappears. 
That is, the problem of the origin of inertia is resolved through the wave nature 
of matter. 

Thus far the inertial mass of the particle has provided the quantum con- 
nection to the inertial reference frames of Newtonian mechanics. The invariance 
group of Minkowski spacetime is the Poincare group whose irreducible unitary 
representations can be described in terms of mass and spin. Thus in the rela- 
tivistic theory the inertial properties of a particle are characterized by mass and 
spin. The inertial properties of intrinsic spin have been discussed in [39]. 

Inertia has its origin in the fact that matter is intrinsically extended in space 
and time and through this nonlocality inertial reference frames can be “rec- 
ognized”; then, a physical system tends to preserve its state with respect to 
such frames. This is beautifully illustrated by experiments involving macroscopic 
quantum systems that have phase coherence, such as the recent demonstration 
of Earth’s absolute rotation via superfluid He^ [40] . The quantum aspects of the 
origin of inertia are further developed in [41] . 

7 Discussion 

In this paper we have examined some of the main theoretical aspects of gravito- 
magnetism in general relativity. The influence of the proper rotation of a source 
on the spacetime structure can be studied in various ways. Attention has been fo- 
cused here on certain features of the gravitomagnetic clock effect and its relation 
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with the gravitomagnetic gyro precession. However, other approaches exist and 
should be mentioned. The detection of the gravitomagnetic field of the Earth via 
the Lense-Thirring precession of satellite orbits has been investigated by Ciu- 
folini et al. [42]. Moreover, gravitomagnetic effects in the spacetime curvature can 
be measured in principle using gravity gradiometry [43] . In this connection, it is 
interesting to note that gravity gradiometers of high sensitivity that are based 
on atomic interferometry are being developed for space applications [44,45]. 



Appendix: Mach and the Absolute Motion of Light 

Newton’s introduction of the concepts of absolute space and time was truly 
revolutionary in his day and allowed him to formulate the basic classical laws of 
particle motion. Leibniz [46] and Berkeley [47] criticized the notions of absolute 
space and absolute time and emphasized instead the idea of relativity of all 
motion. Later, however. Maxwell [48] relied on absolute space and time for his 
fundamental extension of the Newtonian ideas of motion to electromagnetic field 
propagation. On the other hand, Mach revived the principle of relativity of 
all motion on the basis of a profound analysis of the foundations of classical 
mechanics [49] . Mach’s work played a significant role in Einstein’s development 
of the theory of relativity [50]. 

Mach’s deep physical treatment of the relativity of classical particle motion 
was motivated by his epistemological stance on the relativity of all measurement. 
According to Mach, the result of a measurement is the establishment of a relation 
and not of ’’absolute” notions, since in Mach’s view the latter refer to processes 
or objects that are not empirically verifiable in principle [51]. Mach’s analysis of 
the relativity of particle motion in his great work on classical mechanics [49] was 
not extended to electromagnetic wave motion in his later work on physical optics 
[52] . In this book, Mach discussed the wave theory of light as well as the speed of 
light; however, he apparently made no attempt to put these in the context of his 
epistemological stance on the relativity of all motion. There is no evidence that 
Mach ever wavered in his opposition to absolute motion [53]. However, a number 
of Mach’s contemporaries pointed out the absolute character of the constancy of 
the speed of light and were troubled by the fact that this aspect of the relativity 
theory was in conflict with the relativity of all motion. Among the physicists 
and philosophers who raised such doubts about the epistemological stance of 
the theory of relativity one can mention Friedrich Adler, Hugo Dingier, Philipp 
Frank, Anton Lampa and Joseph Petzoldt. Although it is claimed in the book 
of Blackmore [53] that Mach rejected the principle of the constancy of the speed 
of light because it was in contradiction to his phenomenalistic epistemology due 
to its constant validity independent of all sensations and conscious data, there 
is actually no evidence that Mach ever directly or indirectly commented on the 
constancy of the speed of light [54]. A historical analysis of the situation and 
the influence of these criticisms on Mach can be found in the monographs of 
Blackmore [53] and Wolters [54]. 
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An exposition of the reasons for the supposed opposition to the theory of 
relativity based on epistemological considerations and experimental facts was 
promised to appear in a sequel to Mach’s book on optics [52] in collaboration 
with his son Ludwig, but this was never published. Although the preface to the 
’’Optics” is generally regarded as the most obvious evidence of Mach’s reluctance 
to accept the relativity theory, there is every reason to believe that it was written 
by Ludwig only after the death of his father and expresses Ludwig’s opinion on 
the theory of relativity, despite the fact that Ernst Mach is stated to be the 
author of this preface. More on this conjecture can be found in [54]. 

Finally, the position of Mach vis-a-vis the theory of relativity is also discussed 
in the paper of Thiele [55] . 
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Abstract. The motion of spinning relativistic particles in external electromagnetic 
and gravitational fields is considered. The noncovariant spin formalism is crucial for the 
correct description of the influence of the spin on the particle trajectory. It is shown that 
the true coordinate of a relativistic spinning particle is the naive, common coordinate 
r. A simple derivation is presented for the gravitational interaction of first order in 
spin, for a relativistic particle. The equations of motion obtained for a relativistic 
spinning particle in an external gravitational field differ essentially from the Papapetrou 
equations. Effects of higher order in spin are discussed, including the gravimagnetic 
moment, a special spin effect in general relativity. We consider also the contributions of 
the spin interactions of first and second order to the gravitational radiation of compact 
binary stars. 



1 Introduction 

The problem of the motion of a particle with internal angular momentum (spin) 
in an external field consists of two parts: the description of the spin precession 
and accounting for the spin influence on the trajectory of motion. To lowest 
nonvanishing order in c~^ the complete solution for the case of an external 
electromagnetic field was given more than 70 years ago [1]. The gyroscope pre- 
cession in a centrally symmetric gravitational field had been considered to the 
same approximation even earlier [2]. Then, much later the spin precession was 
investigated in the case of the gravitational spin-spin interaction [3]. The fully 
relativistic problem of the spin precession in an external electromagnetic field 
was also solved more than 70 years ago [4] and then, in a more convenient for- 
malism, using the covariant vector of spin, in [5]. 

The situation with the second part of the problem, which refers to how the 
spin influences the trajectory, is different. Covariant equations of motion for a 
relativistic spinning particle in an electromagnetic field were written in the same 
paper [4] and for the case of a gravitational field in [6]. These equations have 
been discussed repeatedly from various points of view in numerous papers (see, 
e.g., [7-18]). The problem of the influence of the spin on the trajectory of a 
particle in external fields is not only of purely theoretical interest. In particular, 
it attracts attention being related to the description of the motion of relativistic 
particles in accelerators [19] (see also recent review [20]). 

In fact, it is far from being obvious whether one can observe in practice the 
discussed spin corrections to the equations of motion of elementary particles as, 
for instance, an electron or proton. According to the well-known argument by 
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Bohr (see [21]), an additional Lorentz force due to the finite size of the wave 
packet of a charged particle and to the uncertainty relation exceeds the cor- 
responding component of the Stern-Gerlach force. However, this argument by 
itself does not exclude in principle the possibility to observe a common Stern- 
Gerlach effect, even a small one, in the presence of a larger background due to 
the uncertainty relation. In particular, in a recent paper [22] this possibility is 
claimed to be supported by numerical calculations. Moreover, spin-dependent 
correlations certainly exist in differential cross sections of scattering processes. 
So, it was proposed long ago to separate charged particles of different polar- 
izations through the spin interaction with external fields in a storage ring [23]. 
Though this proposal is being discussed rather actively (see review [20]), it is 
not clear up to now whether it is feasible technically. 

There are however macroscopic objects for which internal rotation certainly 
influences their trajectories. We mean the motion of Kerr black holes in ex- 
ternal gravitational fields. This problem is of importance in particular for the 
calculation of the gravitational radiation of binary stars. In this connection it was 
considered in [24-27]. However, when turning to these calculations, we found [28] 
that the equations of motion taking account of spin to the lowest nonvanishing 
order in c“^, used in these papers, lead to results which differ from the well- 
known gravitational spin-orbit interaction even in the simpler case of an external 
field. The problem is essentially related to the correct definition of the center- 
of-mass coordinate. Moreover, it turned out that the widely used Papapetrou 
equations [6] also fail to reproduce in the same c~^ approximation the result 
for the gravitational spin-orbit interaction found in the classical work [2]. This 
discrepancy was pointed out long ago in [29] ; however the explanation suggested 
in [29] does not appear satisfactory (see [28]). 

The present talk is essentially based on recent work [28,30,31] where the 
equations of motion of a relativistic particle were derived with a noncovari- 
ant description of spin. These equations agree with well-known limiting cases. 
Though for an external electromagnetic field such equations (in the linear in spin 
approximation) have been obtained previously [19] (see also [20]), we would like 
to start with comments related to this approximation in electrodynamics. 

2 Covariant and Noncovariant Equations of Motion 
of a Spinning Particle in an Electromagnetic Field 

2.1 The Problem with the Covariant Equations of Motion 

The interaction of spin with external electromagnetic field is described, up 
to terms on the order of c~^ included, by the well-known Hamiltonian (see, 
e.g., [32]) 

H = -^s-B+ ^-^^s-[pxE]. (1) 

Zm Zm^ 

Here B and E are external magnetic and electric fields; e, m, s, and p are the 
particle charge, mass, spin, and momentum, respectively; g is its gyromagnetic 
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ratio. Let us emphasize that the structure of the second (Thomas) term in this 
expression not only has been firmly established theoretically, but has also been 
confirmed with high accuracy experimentally, at any rate in atomic physics. To 
avoid misunderstandings, let us note that, generally speaking, the last term in 
formula (1) should be rewritten in a Hermitian form (see, e.g., [33]): 

[px E]^]^{[px E]-[E xp]) = [px E] + ]^V X E . 

We will be interested, however, in the semiclassical approximation mainly, when 
in the interaction linear in spin, field derivatives are neglected. (Besides, the 
correction with 'V x E vanishes in the case of potential electric field considered 
in [32].) 

Let us try to construct a covariant equation of motion accounting for spin, 
which would reproduce in the same approximation the force 

^2m ^ s[vxE,m]^, (2) 

corresponding to the Hamiltonian (1) (here and below a comma with a subscript 
denotes a partial derivative). A covariant correction to the Lorentz force 
eF^'^Ui, should be linear in the tensor of spin and in the gradient of the 
tensor of electromagnetic field , it may depend also on the 4-velocity u^. 
Since = 1, this correction must satisfy the condition = 0. From the 
mentioned tensors one can construct only two independent structures meeting 
the last condition. The first. 



(3) 

reduces in the approximation to 

2s{B,m-[v X E,^]), (4) 

and the second, v 

(5) 

reduces to 

|[^x £;],„. (6) 

Let us note that possible structures with the contraction reduce to (3) 

and (5), due to the Maxwell equations and the antisymmetry of S^x- 

Obviously, no linear combination of (4) and (6) can reproduce the correct 
expression (2) for the spin-dependent force. In a somewhat less general way this 
was shown in [28]. 

But why is it that the correct (in the c~^ approximation) formula (2) cannot 
be obtained from a covariant expression for the force? Obviously, one can easily 
reproduce by a linear combination of (4) and (6) those terms in (2) which are pro- 
portional to g. In other words, there is no problem to present in a covariant form 
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the terms which describe, so to say, direct interaction of a magnetic moment with 
external fields. It is the terms in (2) independent of g and corresponding to the 
Thomas precession which cannot be written covariantly. Certainly, the Thomas 
precession can be described beyond the approximation, for arbitrary veloc- 
ities. But there are no reasons why this essentially noncovariant phenomenon 
should have a covariant description. This is the point. 



2.2 What Is the Correct Definition of the Coordinate 
of a Spinning Particle? 

It was noted in [28] that the covariant formalism can be reconciled with the 
correct results if the coordinate x entering the covariant equation is related to 
the usual one r in the c~^ approximation as follows: 



1 

2m' 



-s X V. 



The generalization of this substitution to the case of arbitrary velocities 

7 1 



X = r 



1(7 -f 1) 



S X V, 



1 = 



yr— 



( 7 ) 



(8) 



was pointed out in [20]. 

But why can the spin precession itself (as distinct from the spin influence 
on the trajectory) be described covariantly [4,5] without any concern for the 
coordinate definition? First of all, the covariant equations of spin precession 

flQ p 

^ ^ -{g- 2)u^,F^pU^S''] (9) 

(here 5'^ is the covariant 4-vector of spin) are written in the semiclassical approx- 
imation, i.e., the coordinate dependence of external fields is completely neglected. 
Second, equations (9) are linear and homogeneous in spin. So, even if one went 
here beyond the semiclassical approach, but stayed within the approximation 
linear in spin, the use of the usual coordinate r, which differs from x in terms 
proportional to s only, would be completely legitimate. 

Of course, the choice of the variable, r or x, is by itself a matter of convention. 
But still, which of them is the true center-of-mass coordinate of a relativistic 
spinning body? 

We note first of all that relation (7) is valid for a free particle as well. So, 
to answer the question, it is sufficient to consider a simple example of the free 
Dirac particle with the Hamiltonian 



H = a ■ p + Pm . 

Here, the operator whose expectation value equals to r, 
iP{oL ■ p)p +[S X p\e 



is not r itself, but [34] 



X = r 



iPa 



2e'^{e + m) 



: = 



i 2 . 



S = — [a X a] 

2F ' 



( 10 ) 
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To lowest nonvanishing order in c ^ expression (10) reduces to 

1 1 

X = r + s X V , s = <j , 

2m 2 ’ 



( 11 ) 



which might prompt indeed substitution (7). However, under the Foldy-Wout- 
huysen (FW) transformation, which separates positive-energy states from nega- 
tive-energy ones, the relativistic operator x goes over into mere r. And the 
transition from the exact Dirac equation in an external field to its approximate 
form containing only the first-order correction in c“^, is performed just by means 
of the FW transformation. Thus, in the resulting Hamiltonian the coordinate 
of a spinning electron is the same r as in the completely nonrelativistic case. 
Nobody makes substitution (7) in the Coulomb potential when treating the 
spin-orbit interaction in the hydrogen atom. 

As to a classical particle, it is in fact a well-localized wave packet constructed 
from positive-energy states, i.e., it is properly described in the FW representa- 
tion. Therefore, it is r which is the true coordinate of a classical relativistic 
spinning particle. The same conclusion is made in [35], starting from quite dif- 
ferent arguments. 



2.3 The Noncovariant Formalism 

The correct equations of motion in an electromagnetic field including spin to first 
order are known for a fairly long time [19]. Though being fully relativistic, they 
are noncovariant and based on the initial physical definition of spin. According 
to this definition, spin is the 3-dimensional vector s (or 3-dimensional antisym- 
metric tensor Smn) of the internal angular momentum defined in the rest frame 
of the particle. The covariant vector of spin (or the covariant antisymmetric 
tensor are obtained from s (or Smn) by the Lorentz transformation. By 

the way, an advantage of this approach is that the constraints = 0 and 

= 0 hold identically. The precession frequency for spin s at an arbitrary 
velocity is well-known (see, for instance, [32]): 

fl= — \{g-2) B —v(vB)-vxE 

2m Y [ 7-k 1 

-g2 -B —V xE \ . (12) 

.7 7+1 Ji 

Naturally, the corresponding interaction Lagrangian (here the Lagrangian for- 
mulation is somewhat more convenient than the Hamiltonian one) equals 

Lei = f2 ■ s = s • < (o — 2) B — — - — v(vB) — v x E 
2m ^ [ 7 -kl ^ ^ 





-k2 



(13) 
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The equation of motion for the position has the usual form: 

(V-^V„)Ltot = 0, (14) 

where Ltot is the total Lagrangian of the system. The equation of motion for the 
spin in general form is 

s = -{Ltot,s}, (15) 

where {... , ...} is the Poisson bracket, or 

s = -i[Ltot,s] (16) 

in the quantum problem. This applicability of a common canonical formalism is 
one more advantage of the noncovariant approach. Meanwhile, in the covariant 
approach we have to deal in particular with higher time derivatives, which is 
obvious already from relationship (7). 

3 Spin Precession in a Gravitational Field 

In this section we present a simple and general derivation of the equations of the 
spin precession in a gravitational field (restricting to first order in spin), based on 
a remarkable analogy between gravitational and electromagnetic fields. Due to 
this correspondence, the formulae of the previous section are naturally adapted 
for the case of an external gravitational field. In this way we easily reproduce 
and generalize known results for gravitational spin effects. 
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The meaning of Eqs. (18), (20) is clear: the tetrad components of both vectors 
vary in the same way, due only to the rotation of the local Lorentz vierbein. 

In exactly the same way, the 4-dimensional spin and velocity of a charged 
particle with the gyromagnetic ratio g = 2 process with the same angular velocity 
in an external electromagnetic field, by virtue of equation (9) a,t g = 2 and the 
Lorentz equation: 



dr 



e 

m 



FabS^; 



dUg 

dr 



-Fabu\ 

m 



Thus, the correspondence: 



m 



(21) 



becomes obvious. This correspondence makes it possible to obtain the precession 
frequency a; of the 3-dimensional vector of spin s in external gravitational field 
from expression (12) via the simple substitution 




1 

2 



SiklJklcU^] 



— Ei — ;> 7oicU . 

m 



This frequency is 



— ^ikl 



1 

^Iklc + 




U 

b 



(22) 



(23) 



The factor 1/u^ in this expression is related to the transition in the left-hand 
side of Eq. ( 18) to the differentiation with respect to the world time t: 



d dt d Q d 
dr dr dt dt' 

A subscript w is attached to the quantity to emphasize that is a world, 
but not a tetrad, component of 4-velocity. All other indices in expression (23) 
are tetrad ones, c = 0, 1, 2, 3; i,k,l = 1, 2, 3. The corresponding spin-dependent 
correction to the Lagrangian is 



Lgg — (aJ • s . i.‘2F) 

However, in some respect, the first-order spin interaction with a gravita- 
tional field differs essentiallly from that with an electromagnetic field. In the 
electromagnetic case, the interaction depends, generally speaking, on a free phe- 
nomenological parameter, (/-factor. Moreover, if one allows for the violation of 
P and T invariances, one more parameter arises here, the value of the electric 
dipole moment of the particle. The point is that both magnetic and electric dipole 
moments interact with the electromagnetic field strength, thus this interaction 
is gauge-invariant for any value of these moments. Only the spin-independent 
interaction with the electromagnetic vector potential is fixed by the charge con- 
servation and gauge invariance. Meanwhile, the Ricci rotation coefficients ')abc 
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entering the gravitational first-order spin interaction (24), as distinct from the 
Riemann tensor, are not covariant. This interaction is fixed in a unique way by 
the angular momentum conservation in flat space-time, taken together with the 
equivalence principle, it has no free parameters [37,38]. 

The tetrads Ca/j. are related to the metric as follows: 

^afj.^bL'V — 

In the weak-field approximation there is no difference between the tetrad and 
world indices in The ambiguity in the choice of tetrads will be fixed by 
choosing the symmetric gauge e^i, = . Then (with = 77 ^^ + 

Using expression (19) for the Ricci coefficients, we find in the weak-field approx- 
imation ^ 

To6c — '^{hhc^a hac^b) ■ (^^) 

This approach is applied below to the problems of spin-orbit and spin-spin 
interactions, as well as to the spin precession in a plane gravitational wave^. We 
restrict mostly to the weak-field approximation. However, as distinct from the 
standard approaches, all three problems can be easily solved now at arbitrary 
particle velocities. The combination of a high velocity for a spinning particle 
with a weak gravitational field obviously refers to a scattering problem. Another 
possible application is to a spinning particle bound by other forces, for instance, 
by electromagnetic ones, when we are looking for the correction to the precession 
frequency due to the gravitational interaction. 



3.2 Spin Orbit Interaction 

In the centrally symmetric field created by a mass M, the metric is 

f, _ ^3 _ ^ _ Tg ^ _ 2kM ^ 

— — 7 ^mn — ^mn — ^mn- 

r r r r 

Here the nonvanishing Ricci coefficients are 

, kM 

~yijk — ^2 OikTj) , TOiO — ^2 ■ 



(26) 

(27) 



Plugging these expressions into formula (23) yields the following result for the 
precession frequency: 

2q -I- 1 kM 

ujis = — ^ V xr . (28) 

7 -|- 1 



In the limit of low velocities, 7 — >■ 1, the answer goes over into the classical result 

of [2]. 

^ I am grateful to T. Vargas for attracting my attention to the last problem. 
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Let us note here that in the case of an external gravitational field there is no 
covariant expression for the force linear in the particle spin. In other words, the 
deviation from geodesics of the trajectory of a spinning particle is not described 
by the Riemann tensor. If this was the case, there would be a unique possible 
covariant structure, to within a factor (in [6] it equals — l/2m): Rfj,uabu'' S°‘^ . As 
mentioned already in Section 1, the covariant description contradicts in partic- 
ular the classical result [2] for the gravitational spin-orbit interaction. 



3.3 Spin Orbit Interaction in the Schwarzschild Field 

In the present subsection we treat the spin precession in the Schwarzschild field 
beyond the weak-field approximation (though neglecting the spin influence on 
the trajectory). The 3-dimensional components of the Schwarzschild metric can 
be conveniently written as 



9n 



( 



— f ^m.n, 



1 



1 — Tg/r 

Nonvanishing tetrads are chosen as follows: 



— ^mn 



1 



l-Tglr 



4°^ = + nkUm 






■ (29) 



(30) 



in this subsection the tetrad indices are singled out by brackets. Now the non- 
vanishing Ricci coefficients (here their last indices are world ones) are 



kM 






1 - \/i - ’’sA 



At last, the precession frequency in this case is 

^ Tn { 2 

uj = — L 



2mr^ 



w 



{SjkTi - SikTj) . (31) 



(32) 



-I- u° y^l-rgjr 1 -I- \J^-rgjr j 



Here m and L are the particle mass and orbital angular momentum, respectively; 

^° = ^ “ ’’sA - (« • - rg/r)~^ - . 

The rather cumbersome general expression (32) simplifies for a circular orbit. 
Here 



u^= 1 - 



3kM\ 



- 1/2 



fkM\ 
L = rar I 1 



1/2 






SO that 



iO = 



{kMf/"^ 



r-3/2 



1 - 1 - 






(33) 



The general case of spin precession in the Schwarzschild held was considered 
previously in [39]. Our expression (33) agrees with the corresponding result of [39] 
(the precession is considered there with respect to the proper time t, but not 
with respect to t). 




118 I.B. Khriplovich 



3.4 Spin Spin Interaction 

Let the spin of the central body be Sq. The components of the metric linear in 
So, which are responsible for the spin-spin interaction, are 



hot = 2k 



[so X r\ 



Here the nonvanishing Ricci coefficients are 



, , ^ [so X r]j [so X r] 

Itjo = k V, - Vj — ^ 



The frequency of the spin-spin precession is 
ujss = -k ( 2 - (sqV)V^ 



, ^ [so X r]j 
7oii — kVi 



+k y [u(so • V) - sq{v • V) + (u • So) V] (u • V) - . 
7+1 r 



(34) 



(35) 



In the low-velocity limit this formula also goes over into the corresponding clas- 
sical result [3]. 



3.5 Spin Precession in a Plane Gravitational Wave 

Let a weak gravitational wave propagate along the axis 3. It is well known (see, 
for instance, [36], §107) that here coordinate conditions can be choosen in such 
a way that the only nonvanishing components of are 

hii = — ft -22 = fl{t — z), hi 2 = ft -21 = f 2 {t — z). 

Straightforward (though rather tedious) calculation with formulae (23), (25), 
results in the following expressions for the components of the angular velocity: 

^ (/ 1 W 2 - hvi); 

hv2); (36) 

- {vf - 4) ■ 

The equations of motion in a plane gravitational wave, following from the cor- 
responding Lagrangian L — uj^s, differ essentially from those obtained in [40,41] 
within the covariant approach. 



=1 —V 3 

-y + 1 



<^w2 =1 —rV3 

7 -I- 1 



^wS — 



7 -I - 1 



\f 1 v 1 v 2 
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4 Effects of Higher Order in Spin 

4.1 Outline of the General Formalism 

The above description of the effects linear in spin was obtained by rather simple 
means. As to the interaction of second order in spin, it may manifest itself at least 
in the motion of rotating black holes (and possibly in some subtle spin effects for 
polarized nuclei of high spin in storage rings) . Anyway, going beyond the linear 
approximation in spin is of a certain theoretical interest. To study this general 
problem, we have to resort to a more sophisticated approach [30,31]. It is based 
on the following physically obvious argument. As long as we do not consider 
excitations of internal degrees of freedom of a body moving in an external field, 
this body (even if it is a macroscopic one!) can be treated as an elementary 
particle with spin. Thus, the Lagrangian of the spin interaction with an external 
field can be derived from the elastic scattering amplitude of a particle with spin 
s by external field. In this way we can describe the interaction of a relativistic 
particle to first order in the external field, but to arbitrary order in the spin. 
Explicit closed formulae were obtained in [30,31] for the interaction of second 
order in spin. According to the arguments presented in Section 1, the discussion 
of the effects nonlinear in spin may be physically meaningful first of all in the 
classical limit s 1. This limit is certainly adequate for rotating black holes. 
However, having in mind the mentioned problem of polarized nuclei, as well as 
some theoretical questions, the results were derived in [31] for arbitrary spins. 

The accurate derivation of the second-order spin Lagrangians in electrody- 
namics and gravity, based on rather sophisticated technique of high spins, can 
be found in [30,31]. Here we confine ourselves mainly to a qualitative discussion 
of some curious features of the second-order spin interactions. 



4.2 

Even 



Second- Order Spin Effects in an Electromagnetic Field 

the final formula for the discussed interaction is lengthy: 



Lf,o — 



Q_ 

2s(2s- 1) [ 



(s-V) - ^(.;-s)(r;.V) 



(s-E)- 



7 



7 



(s • v){v ■ E) + (s • [t) X B]) 



2m? 7 -|- 1 



7 -I- 1 
(s • [d X V]) 



g-l+ -] (s-B) 
'T / 



-(ff-i) 



7 

7-1-1 



(s-r>)(i;-B) - [g- 



7 -I- 1 



(s • [n X E]) 



(37) 



Here the particle quadrupole moment Q is defined as usual: Q = Qzz\s^=s- 
It is well-known that the electromagnetic interaction of the convection cur- 
rent and magnetic moment also contributes in the nonrelativistic limit to the 
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quadrupole interaction. The value of this, induced contribution to the quadrupole 
moment (already included into Q in formula (37)) is [42] 



AQ = -e{g-l) 




s, integer spin, 

s — 1 / 2 , half-integer spin. 



(38) 



We have singled out explicitly in this formula the Planck constant h to demon- 
strate that the induced quadrupole moment AQ vanishes in the classical limit 
ft— lO, s— >-oo, fts — 1 const. Therefore, the contribution proportional to AQ 
does not influence in fact equations of motion of a classical particle (though it 
plays a role in atomic spectroscopy [42]). 

On the other hand, the electromagnetic interactions of the convection current 
and spin current also induce an interaction of second order in spin which has 
a classical limit and is described by the last two lines of formula (37). This Q- 
independent part of the interaction (37) tends to zero in the nonrelativistic limit. 
Besides, it is reducible in spin; in other words, the structure SiSj in it cannot 
be rewritten as an irreducible tensor SiSj — (l/3)SijS^ . The (/-independent part 
of the interaction (37) does not have a quadrupole structure at all. 

Of great interest is the asymptotic behaviour of the interaction (37) at 
7 — >■ oo . Surprisingly, though both (/-dependent and (/-independent parts of 
the interaction (37) by themselves grow up with energy, there is a singled out 
value of the quadrupole moment for which this interaction as a whole falls down 
at 7 — >■ oo . 

The situation resembles that which takes place for the interaction linear in 
spin. It is well-known (see, e.g., [11,43,44]) that there is a special value of the 
^-factor, 5 = 2 , at which the electromagnetic interaction linear in spin, decreases 
with increasing energy. This follows immediately from formula (12) for 7 — >■ oo. 
Thus, the choice g = 2 for the bare magnetic moment is a necessary (but insuffi- 
cient!) condition of unitarity and renormalizability in quantum electrodynamics. 
It holds not only for the electron, but also for the charged vector boson in the 
renormalizable electroweak theory. Other arguments in favour of g = 2 are given 
in [45-49]. 

The same situation takes place with the second-order spin interaction in 
electrodynamics. There is a special value of the quadrupole moment Q at which 
this interaction as well decreases with increasing energy. If we also assume g = 2, 
this value is 



Q = -s{2s-1)A,. (39) 

The same preferred value of the quadrupole moment was derived also otherwise, 
basing on the supersymmetric sum rules [46,48,49]. Again, (39) is a necessary 
condition of unitarity and renormalizability. And indeed, this is the value of 
the quadrupole moment of the charged vector boson in the renormalizable elec- 
troweak theory. For it 



g = 2 , s=l, Q = —elvr? . 
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4.3 Second-Order Spin Effects in a Gravitational Field 

For a binary star effects of second-order in spin are of the same order of mag- 
nitude as the spin-spin interaction when the spins of the components of the 
system are comparable [28]. The influence of the latter on the characteristics 
of the gravitational radiation becomes noticeable for a system of two extreme 
black holes [25]. Correspondingly, second-order spin effects in the equations of 
motion become substantial if at least one component of a binary is close to an 
extreme black hole [28]. Therefore, the investigation of these effects is not of 
a purely theoretical interest only. In principle they can be observed with the 
gravitational wave detectors under construction. 

Though the second-order gravitational spin interaction can be obtained with- 
in our general approach as well, we will resort here to an instructive short-cut 
which allows to derive easily the so-called gravimagnetic interaction [11], a grav- 
itational analogue of the Q-dependent terms in formula (37). It was mentioned 
already that the analogy between first-order spin interactions in electrodynamics 
and gravity is incomplete. The electromagnetic interaction depends on the field 
strength, which is gauge-invariant. However, the gravitational one depends not 
on the Riemann tensor, which is generally covariant, but on the Ricci rotation 
coefficients, which are not. In this respect, the second-order spin interaction dis- 
cussed below, the gravimagnetic one, which depends on the Riemann tensor, is 
the gravitational analogue of the first-order spin interactions in electrodynamics. 

The starting point of the derivation is the observation that the canonical 
momentum enters a relativistic wave equation for a particle in external elec- 
tromagnetic and gravitational fields through the combination 

n^j,=Pf,~ i 

Here 27“** are the generators of the Lorentz group; ^ab^l = e^Jabc- The commu- 
tation relation 

[7T^, il.] = S’^^Rabf.u (40) 

demonstrates the remarkable correspondence 

i E'^^Rab^.. (41) 

The squared form of the Dirac equation in an external electromagnetic field 

+ m2 + e F“''F,b) ^ = 0 

prompts the correct conclusion that for an arbitrary spin s the Lagrangian 
— e/ (2m) E°‘^Fab describes the magnetic moment interaction for g = 2] the factor 
1 /(2m) in this Lagrangian, additional to the above wave equation, becomes obvi- 
ous from the comparison with the nonrelativistic limit. Clearly, for an arbitrary 
^-factor this covariant magnetic moment interaction is 

r IP inab 

t^el — — i^ab^ ■ 



( 42 ) 
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This is in fact a covariant form of ^-dependent terms in the Lagrangian (13). 
As to the ^-independent, Thomas terms in (13), it was pointed out already that 
they cannot be presented in a covariant form with the usual, physical definition 
of the coordinate r. It is natural to define, in analogy with the magnetic moment 



the gravimagnetic moment 






^ ^ab ^cd 

2m 



Now, the correspondence (41) prompts the following gravitational analogue of 
the Lagrangian (42): 



Y'ab y^cd ry 

am — 7 ^ ^ ^ ^abcd' 

8m 



This is what we call the gravimagnetic interaction. Let us note that in the 
classical limit ^ 5“^ = e^^^^^ScUd- 

The gravimagnetic ratio k, like the gyromagnetic ratio g in electrodynamics, 
may have in general any value. Still, it is natural that in gravity the value k = 1 
is as singled out as g = 2 in electrodynamics. Indeed, the analysis of the complete 
Lagrangian for the gravitational interaction of second order in spin, including of 
course K-independent terms which correspond to the Q-independent terms in 
(37), demonstrate that just for k = 1 this total interaction asymptotically tends 
to zero with increasing energy [11,30,31]. The same conclusion is made in [50-53]. 
Unfortunately, the gravitational interaction for any spin is not renormalizable 
even at k = 1 . 

In any case, a,t g = 2 and k = 1 the equations of motion have the simplest 
form. Moreover, it has been shown in [11] that just this value of the gravimagnetic 
ratio, K = 1, follows from the wave equations in the Feynman gauge both for 
the photon and graviton in an external gravitational field, as well as from the 
Rarita-Schwinger equation for s = 3/2 in a gravitational field. 

The situation for spin 1/2 is rather tricky. Obviously, no second-order spin 
interaction is possible here. Indeed, for spin 1/2 the properties of the spin ma- 
trices 27“^ = i/ 4 ( 7 “ 7 ^ — 7 ^ 7 “) are such that Rabcd degenerates into the 

scalar curvature R (times 1/2) without any spin dependence at all. So, our ar- 
guments in favour of k = 1 do not apply for spin 1/2. And indeed the squared 
Dirac equation contains 1/4 7?, but with k = 1 one obtains here 1/8 7?. Never- 
theless, we cannot see any real physical meaning in the recent proposal [54] to 
ascribe to the electron (which in fact has no gravimagnetic interaction at all) 
the gravimagnetic ratio k = 2. 

Wave equations for particles of arbitrary spins in an external gravitational 
field were previously considered in [55] . The equation for integer spins proposed 
in [55] corresponds also to the gravimagnetic ratio k = 1. However, the value 
of K prescribed in [55] for half-integer spins is different. Even in the classical 
limit s — >■ 00 it does not tend to unity. This obviously does not comply with the 
correspondence principle according to which at least in this classical limit there 
should be no difference between integer and half-integer spins. 
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5 Multipoles of Black Holes 

Let us come back from elementary particles to macroscopic bodies. For a classical 
object the values of both parameters g and k depend in general on the various 
properties of the body. However, for black holes the situation is different. It 
has been shown in [56] from an analysis of the Kerr-Newman solution that the 
gyromagnetic ratio of a charged rotating black hole is universal (and equal to 
that of the electron!): g = 2. 

We will show that for the Kerr black hole the gravimagnetic ratio is /t = 
1. This value follows in fact from the analysis of the motion of the spin of a 
black hole in an external field in [24] (though this statement was not explicitly 
formulated there). We present here an independent and, in our opinion, simpler 
derivation of this important result. 

At large distances from a Kerr hole, the hole can be considered as a point 
source of a weak gravitational field. To linear approximation in the field of a 
hole at rest, the Lagrangian density corresponding to the interaction (43) can 
be written as 

L= ^{s-V)^hoo6{r). (44) 

4 to 

This interaction unduces a correction to the energy-momentum tensor in a single 
component: 

<5Too = 

In the gauge 

0, 

the static Einstein equation for 
component of the metric is 

Ahoo = SnkToo ■ 

The correction itself is 

hoo =«-(«• V)2 - . (47) 

m r 

Let us compare /iqo with the corresponding contribution to the Kerr metric. 
In the Boyer-Lindquist coordinates this metric is 

ds^ = (1 - !^) dt^ - ^dr^- S d0^ 

^ s' A 

— (r^ + a? + sin^ 9) sin^ 9 + 9 d(j)dt, (48) 

where A = r'^ — VgV + , S = + a? cos^ 9 , a = s/m. At = 0 the 

metric (48) describes a flat space in spheroidal coordinates [36]. Meanwhile, it is 



-2^(^-V)2<5(r). (45) 

hfii, = hfin 1 ( 46 ) 

the corresponding correction /iqo to the 00- 
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Cartesian coordinates which correspond in the flat space to the gauge (46) . The 
transition from the spheroidal coordinates to Cartesian ones is carried out with 
the required accuracy by the substitution 



r — i r + 



a{a ■ r) — ra? 
2)^2 



In the Cartesian coordinates the spin-dependent part of the 00-component of 
the metric 

500= 1- ^+’^(3cos 20- 1) 
r Zr'^ 

obviously coincides with ft-oo from formula (47) at k = 1. Somewhat more tedious 
consideration of the space components of the Kerr metric leads to the same 
result, K= 1. 

Let us note that the motion of the Kerr black hole in an external gravita- 
tional held is not described by the Papapetrou equation even if one leaves aside 
the problem of the spin-orbit interaction linear in spin. The point is that this 
equation refers to the case k = 0 [14]. 

It is proven in the same way that for a charged Kerr hole the gravimagnetic 
ratio K is also unity. Moreover, the electric quadrupole moment of a charged 
Kerr hole also equals 

0 = -2^, (49) 



the value, at which the interaction quadratic in spin decreases with energy (this 
is the obvious limit of the general formula (39) at s — >■ 00 ). It can be demon- 
strated [57] that other, higher multipoles of a charged Kerr hole, both electro- 
magnetic and gravitational, as well possess just those values which guarantee 
that the interaction of any order in spin (but of course, linear in an external 
held) asymptotically decreases with increasing energy. 



6 Gravitational Interaction of Spinning Bodies, 
and Radiation of Compact Binary Stars 

It is expected that in a few years the gravitational radiation from coalescing 
binary stars will be observed by laser interferometer systems. Its successful de- 
tection depends crucially on the accurate theoretical prediction of the exact 
form of the signal. In this way the observed effect becomes sensitive to the rel- 
ativistic corrections of the orders c“^, c~^ and c~^ to the motion of a binary 
system and to the radiation intensity. In particular, the spin-orbit interaction 
becomes essential. Moreover, effects of second order in spin may be observed in 
the gravitational radiation, in the case of two extreme Kerr black holes [25]. 



6.1 Spin Interactions in a Two Body Problem 

The spin interactions in a two-body problem can be easily obtained from the 
well-known results for the limiting case when one of the bodies (say, 2) is very 
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heavy. In this limit we have the usual spin-orbit interaction with the frequency 
u>is given in fact by formula (28) (the limit 7 — >■ 1 is sufficient here): 



VL = = 



3 k 
2 



m 2 

mi 



Isi. 



(50) 



Then, there is the well-known Lense-Thirring interaction of the orbital angular 
momentum I with the spin S 2 of the central body [58] 






2ls 




(51) 



Simple symmetry arguments dictate now the form of the total spin-orbit inter- 
action for the two-body problem: 




3 

2 



m 2 

— -l- 

rrii 



mi 

— «2 
m 2 



+ 2 («i -|- S 2 ) 



(52) 



As to the spin-spin interaction, it is of the usual form, with given by formula 
(35) (again the lowest nonvanishing order in 1/c is implied): 



Vss = ^ [3(sin)(s2n) - S 1 S 2 ]. 



(53) 



Of course, both expressions (52) and (53) can be derived directly, following, for 
instance, the approach of [36] (§106, Problem 4). 

Let us go over now to the gravimagnetic interaction. This interaction (43) 
for a particle 1 with the field created by a heavy mass m 2 reduces in lowest, first 
order in to the quadrupole form: 



1 _ n n 

~ 2^3 

where the effective gravitational quadrupole moment of the particle 1 is 



(54) 



Kl 



1 



Qlmn — {simSln „ ^mn^l) ■ 

mi 3 



For the two-body problem under discussion, expression (54) generalizes to the 
following self-interaction of spin: 



3fc 



2 r^ 



m 2 



mi 



Vs — Ki SimSln + K2 S2mS2 



mi 



m 2 



2 ^mn) j 



(55) 



resembling the usual spin-spin interaction (53) . 

At ki ^2 ~ 1 the effective quadrupole interaction (55) is of the same order of 
magnitude as the spin-spin one (53). Even in the most favourable case when they 
can become important, that of two extreme Kerr black holes, both interactions 
are of the order. The star rotation velocity is here ~ c, but the star radius is 
close to the gravitational one rg ~ c“^, so that each spin s ^ c~^ [25]. The same 
argument demonstrates that the spin-orbit interaction is of the c~^ order [25]. 

As to the common quadrupole interaction, due to quadrupole deformations of 
the stars, it is suppressed by the small value of these deformations and, according 
to [59], can also manifest itself only in the case of two extreme Kerr black holes. 




126 I.B. Khriplovich 



6.2 Contribution of Spin Interactions to Gravitational Radiation 



The spin interactions contribute in various ways to the gravitational radiation: 
through spin-dependent corrections to the orbit radius r and to the equations 
of motion used to evaluate the time derivatives, which enter the usual expres- 
sion for the gravitational quadrupole radiation; through the corrections to the 
00 component of the energy-momentum tensor of the particles; through the 
gravitational analogue of the magnetic quadrupole radiation in electrodynamics; 
through retardation effects. 

In all our discussions of gravitational radiation we restrict ourselves to the 
case of circular orbits which is the most interesting one from the physical point 
of view [25]. Besides, the assumption of circular orbits simplifies essentially the 
calculations. Still the calculations remain tedious, so only the final results are 
presented here. 

The relative correction to the radiation intensity generated by the spin-orbit 
interaction (52) is [28] 

_ 1{73s + 45Q 
Iq I2mim2r‘^ 

Here 

32A:"^mfm|(mi -I- m 2 ) 

is the unperturbed quadrupole intensity and 

, m 2 nil 

S=Si + S2] C= — •si H «2- 

nil m 2 

It can be easily checked that the corresponding result of [25,26] would be recon- 
ciled with this one under the proper definition of the center-of-mass coordinate. 
The correction due to the spin-spin interaction (53) is [25,26] 

^ y ( 649 suS2t - 223 S 1 S 2 ). (57) 

Iq 48mim2r^ 

The expressions for Igs, as well as that for Ig below, have been averaged over 
the period of rotation. That is why both of them contain the spin components 
St orthogonal to the orbit plane. 

And at last, the spin-self-interaction correction, generated by the gravimag- 
netic interaction (55) and by the above-mentioned gravitational analogue of the 
magnetic quadrupole radiation in electrodynamics, is [28] 



L 

I, 



1 

4mim2r^ 




rr^ 2 
24 ] mi 





mi 

m 2 



s 



2 

2t 




m2 2 
24 y mi ^ 




1 ) 

24 ) m 2 ^ J 



(58) 



This correction is discussed also in [60]. 
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Abstract. The last few years have brought a great break-through in the quest for 
earth-bound detection of gravitational waves: at five sites, laser-interferometric detec- 
tors, of armlengths from 0.3 to 4km, are being built. These projects have in common 
that one prominent noise source, the shot noise, is reduced by the use of power re- 
cycling. By using advanced optical technologies early on, the German-British project 
GEO 600, although only intermediate in size (600 m), has good chances for a compet- 
itive sensitivity, at least with the first versions of the larger detectors. Particularly the 
use of the so-called signal recycling technique will allow to search for faint sources of 
only slowly varying frequency (pulsars, close binaries). The talk will describe the par- 
ticular topology of the GEO 600 interferometer, characterized by the use of a four-pass 
delay line and signal recycling. The major noise sources, and the experimental effort 
aiming at their reduction, will be discussed. The current status of the construction of 
GEO 600 will be outlined (civil engineering, vacuum, optics). The research and de- 
velopment activities at the experimental sites (Garching, Glasgow, Hannover) will be 
given broad emphasis. First science runs of GEO 600, well in time with those of other 
ground-based interferometers, are expected in the year 2001. 



1 Introduction 

The construction of laser interferometers for the detection of gravitational waves 
is being pursued by various groups all over the world and, due to their particular 
preferences, their past history, and their funding, differing approaches have been 
developed. This diversity can, at the present stage, be seen as a great asset, as it 
will allow the community to develop and investigate a wide variety of promising 
schemes and evaluate their respective merits. 

It is particularly in this field of research and development that a fruitful 
collaboration within the whole gravitational-wave community can and should 
be intensified. 

This paper will give an account of the current status of the German-British 
project GEO 600 and elaborate on the various fields of research and development 
at the main centers of GEO activity: Hannover, Glasgow, and Garching. In many 
of these fields, the GEO groups have taken a leading role. 

2 The GEO 600 Concept 

At five sites, large-scale gravitational wave detectors using laser interferometry 
are being built and are nearing completion: at two sites in the US, interferome- 
ters of 4 km are being built (LIGO), the French-Italian collaboration VIRGO is 
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building a 3 km detector near Pisa, and on a somewhat smaller scale, with 600 m 
and 300 m armlength: the German-British antenna GEO 600 near Hannover, 
and the Japanese antenna TAMA near Tokyo. 

In the case of GEO 600, the smaller armlength was not a matter of choice, but 
one of necessity. The site (on grounds belonging to the University of Hannover) 
cannot accommodate a larger antenna, and the funds did not allow buying or 
leasing ground elsewhere. This paper will try to outline how this shortcoming in 
length is to be compensated, at least partially, by the application of advanced 
interferometric techniques. 



In its arms, GEO 600 
will employ an optical de- 
lay line with only four 
light transits (DL4), as 
shown in Fig.l. After be- 
ing bounced off the dis- 
tant mirror the beam is 
reflected at the near mir- 
ror and retraces its path 
back to the beam-splitter 
BS. This DL4 scheme dif- 
fers from the conventional 
Herriot delay line, which 
would pose problems in 
size, in scattered light 
effects, and in separat- 
ing incoming and outgoing 
beam. 

Fig. 1. DL 4 configuration of the GEO 600 interferom- , „ . . 

.rp , . HP jHPf jJhe DL4 conhguration is 

eter. iwo extra mirrors Mpr and Msr tor power and ° 

signal recycling. distinct in so far as it 

has the highest number of 
beams (4) that one can 

support in a delay line made only of ‘small’ mirrors (the GEO 600 mirrors will 
nevertheless have a diameter of 18 cm). 

An outstanding characteristic of the GEO 600 concept is the use of ‘dual 
recycling’ [1,2]: In addition to the scheme of power recycling, which is now stan- 
dard in all the large detectors, a further mirror (Msr) is introduced in the output 
port, to allow a resonant enhancement of the sidebands that the gravitational 
wave produces from the carrier beam. 




3 Noise Contributions 

To obtain the projected sensitivity, extensive theoretical and experimental work 
is required to reduce the contributions of noise that would limit the sensitivity 
of GEO 600. These efforts (some are very similar to work going on at other 
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institutions) are being carried out at Hannover, Glasgow, and Garching, with 
assistance also from the institutions at Gardiff and Potsdam. 

Some of the more prominent noise sources (but by no means an exhaustive 
list) will be discussed below. Although these noise contributions are very serious, 
current technologies can reduce them to a level at which an interesting sensitivity 
of GEO 600 can be attained. 

3.1 Seismic Noise 

The influence from external mechanical vibrations, i.e. from seismic noise, is 
strongly reduced by a multiple pendulum suspension, indicated in Fig. 2. With 
triple pendulum suspensions for the most critical optical components (mirrors, 
beam splitter), and double pendulums for the less critical components, GEO 600 
is making a very ambitious effort, topped only by the even more extreme design 
of the VIRGO project with its seven pendulum stages. Installation at GEO has 
begun. 

GEO’s multiple pendulums are hung from a set of cantilever springs for 
better vertical isolation. These springs are supported by a top frame that itself 
is isolated via an encapsulated stack of ‘rubber’ and metal layers. 

Underneath this stack, an active seismic control (feedback plus perhaps feed- 
forward) will provide further isolation down to very low frequencies. 

The forces to the test mass pendulum masses are applied from reaction masses 
that are similarly suspended, to avoid direct coupling from ground noise. The 
forces to the intermediate masses are applied via coil~and~magnet actuators. For 
the application of only very minute forces to the actual test masses (the lowest 
stage), an electrostatic scheme is being considered. 

Work in this held was led by the Glasgow group, with contributions also 
from JILA (active isolation) and in close collaboration with the other GEO 
laboratories. The GEO pendulum design is a candidate for the suspension of the 
masses in the Advanced LIGO. 

3.2 Thermal Noise 

The thermal (‘Brownian’) noise of the suspended optical components can be 
split into two separate regimes: the internal vibrations of the solid suspended 
bodies (mirrors, beamsplitters), and the pendulation modes due to the pendulum 
suspension. Both must be kept as low as possible, to the very limit of current 
technology (or beyond). 

The internal mechanical vibrations of the mirrors (the ‘test masses’) can 
be kept low by choosing materials of very high mechanical Q, such as pure 
fused silica, sapphire, or even silicon. (Note that the mirrors used in the DL4 
scheme do not need to be transparent.) Elaborate methods of attaching the 
suspending ‘wires’ are required so as not to compromise the intrinsic Q. Glasgow 
and Hannover have worked out viable solutions, and Q values of 5 x 10® and better 
have been achieved with fused silica mirrors. This internal noise is responsible 
for the rather shallow portion of the curve ‘thermal noise’ in Fig. 3, above 50 Hz. 
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Also for the pendulum mo- 
tion, extremely low-loss wires 
or fibers, and very elabo- 
rate techniques for attach- 
ing these ‘wires’ to the solid 
masses were developed, and 
very promising results were 
obtained, again by research 
carried out both at Glasgow 
and Hannover [3]. 

Low-loss silica fibers (or 
thin ribbons) were welded on 
to small silica prisms, and 
these were, in turn, attached 
to the mirrors using a bonding 
technique developed at Stan- 
ford, by ‘hydroxy-catalysis 
bonding of silicates’ [4]. This 
method turned out to be 
very successful. The measure- 
ments on these pendulums 
were done in close collabora- 
tion with the VIRGO group at 
Perugia. The pendulum ther- 
mal noise is the cause of the 
steep rise of the thermal noise 
curve as one goes to lower fre- 
quencies (in GEO 600 below, 
say, 30 Hz). 



tank centre 

I 



rotational stage 
stack stabiliser 



stack 




reaction mass 



3 mmn 



I 2 stacks have beenl omitted for clarity 

^ ^ — tr^ 



Fig. 2. Schematic of the GEO suspension. The ac- 
tnal test mass is on the right-hand side, forces are 
applied from eqnally quiet reaction masses, to avoid 
introducing ground noise. 



3.3 Laser Noise 

The development of the laser system for GEO 600 is mainly in the hands of Laser 
Zentrum Hannover (LZH). The master/slave system, with its highly stabilized 
Nd:YAG MISER as ‘master’ and a powerful slave, are now completed, 14 W are 
realiably available [5]. The master is already out at the site. Improved stabiliza- 
tion of the laser with respect to frequency and power is the goal of work going 
on at Hannover. 

The geometrical noise of the beam, i.e. fluctuations in position, orientation, 
and shape of the beam, are greatly reduced by the use of ‘mode-cleaners’ first 
introduced by the GEO groups [6,7]. GEO 600 will use two in series, and inten- 
sive work was done at Glasgow. These mode-cleaners are the first units being 
optically tested, in the late months of 1999. Mirrors and seismic isolation are 
installed, and locking of the first stage has been achieved. 
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4 Shot Noise 

A very fundamental noise source is the shot noise produced by the ‘graininess’ 
of the detected light. Its effect decreases with the square root of the available 
light power. To reach sensitivities that make the detection of events from as far 
out as the Virgo cluster possible, light powers in the order of 10 kW are needed. 

As these light powers cannot be supplied by today’s lasers, the scheme of 
‘power recycling’ will be applied in GEO 600, as in all other detector projects. 
The cavity composed of the Michelson interferometer and the power recycling 
mirror Mpu is designed to resonantly enhance the light power by three orders 
of magnitude. Recycling gains in the order of 300 have been obtained [8] in the 
Garching 30-meter prototype. 



5 The GEO 600 Sensitivity 

Fig. 3 shows the predicted sensitivity (i.e. the spectral density of the apparent 
strain noise) of GEO 600, resulting from the expected contributions of the various 
noise sources. 

Having the signal sidebands resonate in the cavity formed by the Michelson 
interferometer as the one (albeit very complex) mirror, and the signal-recycling 
mirror Msr as the other, will make the signal response a rather complicated 
function of the GW frequency, which can be adapted to a variety of requirements. 

On the left-hand side, the broadband operation of GEO 600 is shown, ex- 
hibiting a broad noise minimum around a frequency of 200 Hz. The limiting 
effects are thermal noise between 40 and 200 Hz and shot noise from 300 Hz on 
upward. 

By appropriate choice of the transmission of the ‘signal recycling mirror’, a 
narrowband operation is possible, and the antenna can (by microscopic posi- 
tioning of the mirror Mgp) be tuned to a given frequency, e.g. to a known or 
expected GW source. This is indicated in the righthand diagram, showing a dip 
in the shot -noise contribution at 600 Hz. 

In this way, the limitation due to shot-noise can be reduced, if one is willing 
to “sacrifice” measuring bandwidth. There can be very good reasons to do that: 
for a nearly continuous-wave signal, as in the early stages of a binary inspiral, the 
signal frequency changes only very slowly, and narrow-banding at the ‘response’ 
level reduces noise much more effectively than narrow-banding at the data- 
analysis level. But, of course, the thermal noise (intrinsic noise of the mirrors) 
does then limit the sensitivity. 

6 Interferometry 

A major subject of research at GEO 600 was the design, the analysis, and a 
critical assessment of various advanced interferometric techniques: dual recy- 
cling [1,2] and resonant sideband extraction [9] appear to have great potential 




136 



A. Rudiger and K. Danzmann 




Fig. 3. Noise spectral density of GEO 600, shown for two operating modes: for broad- 
band (top) and narrowband operation (bottom). 



also for future large detectors. But also the feasibility of other interferometric 
schemes (e.g. Sagnac [10]) was explored. 

Of the schemes investigated, it will be particularly the scheme of ‘dual re- 
cycling’ that will find application in GEO 600. Elaborate simulation programs 
were written (e.g. [11]), compared at a Workshop in Garching, and shared with 
colleagues from other groups. 

At that workshop, also a variety of other software tools were presented to 
assist in the optical design [12] and development of controls [13] for advanced 
interferometry. These have been (or will be) made available to everybody in the 
community. 
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7 Civil Engineering 

GEO 600 was, from the beginning, a project that had to take into account a very 
low budget (the lowest of the world- wide community). Many of the features in 
GEO’s implementation are thus dictated by budget considerations. This was, at 
times, a great burden, but it also gave an incentive for designing and developing 
very cost-efficient techniques that can become the basis also for future large- 
scale detectors. 

7.1 Construction 

The construction of the central house and the end houses had been completed 
in 1997. The size of these buildings is at the lower limit, the accommodation 
of electronics and other equipment is scant, but sufficient. Some mechanical 
equipment is available at the site, but the main workshops are at the university 
site in Hannover. 

The clean-room environment is a low-cost design which, however, turned out 
to be working very satisfactorily; a clean-room class 100 was readily achieved, 
and further improvement during critical work can be achieved by using laminar- 
flow tents above the vacuum tanks. 

The connecting trenches that house the vacuum tubes were finished even 
earlier. The trenches are open constructions, the sides consisting of steel panels 
driven into the ground. For weather protection, they are loosely covered with a 
corrugated steel roofing. 

7.2 Electronics and Data Management 

Most of the electronics is designed and built by the labs in Glasgow, Hannover, 
and Garching. Gommunication between the buildings will be exclusively via glass 
fiber, to avoid crosstalk. The data can be sent via direct radio communication 
to the University of Hannover for on-line monitoring and possible remote inter- 
action, and from there via fast data links to Potsdam and Gardiff for analysis. 

8 The Vacuum System 

8.1 Vacuum Pumps 

The good vacuum required in the whole apparatus is made possible by powerful 
turbo-molecular pumps that are situated in the central house and in the end 
houses. They will run continuously, i.e. also during measurements; the magnetic 
bearings allow a very quiet operation. 
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8.2 Vacuum Tubes 

GEO 600 has tried a very cost- 
effective way of producing the 
vacuum tubing. The tube is 
made of pieces of 60 cm diam- 
eter stainless steel tubes of less 
than 1 mm thickness. The nec- 
essary stiffness against air pres- 
sure stems from a bellows-type 
convolution with a few cm pe- 
riod (see Fig. 4). The tube is sus- 
pended from crossbars that tra- 
verse the trench. 



8.3 The Vacuum 
Achieved 




The tubes, wrapped in a 20 cm 
layer of rock wool, were first air- 
baked at 200 °C for one week, 
and then baked under vacuum 
for several more days. The vac- 
uum achieved, in the range of 
10“®mbar, is by far sufficient. 
The tubes are supposed never 
to be let up to air again: they 
are shut off with gate valves 
when opening the vacuum tanks 
becomes necessary, as in the 



present phase of installation of suspensions. 



Fig. 4. The convoluted tubes during installation; 
with each new 4.5 m piece welded on, the tube was 
slid 4.5 m further into the 600 m long trench, using 
trolleys along the aluminum rail at the top. 



8.4 Vacuum Tanks 

The vacuum tanks (9 in the central house, one each in the end houses) have 
been leak-tested, baked out, and installed. The metal-and-rubber stacks, three 
per tank, are encapsulated in steel bellows to protect the vacuum. They are 
finished, most already installed. They will carry the top suspension structures, 
from which double or triple pendulums will carry the main optical components. 



9 The 30 Meter Prototype 

Many of the advanced interferometric concepts require verification and optimiza- 
tion by laboratory experiments. Such experimental development went on at the 
Glasgow 10 m prototype, as well as at the Garching 30 m prototype. It is the 
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Fig. 5. The Garching 30 m prototype. 



latter that will be treated here in some more detail. For a full discussion, see 
[16]. 

The Garching 30 m prototype (Fig. 5) was used for verifying the concepts of 
power and signal recycling, and for experimentally investigating the demands on 
the overall control system. 

The laser (here still an Argon ion laser) is first stabilised against a reference 
cavity, and then against the power recycling cavity consisting of the interferom- 
eter and the mirror Mpr. A fully automatic alignment system for controlling 10 
mechanical degrees of freedom provides the stability of the settings of the opti- 
cal components [14]. The frontal (‘Schnupp’) modulation simplifies the read-out 
with which the Michelson interferometer is kept in lock, as compared with the 
previous concept of ‘external modulation’. 

Only after this set-up was sufficiently robust, the next step, the introduction 
of the signal recycling mirror Msr, was started. Error signals for controlling its 
position are obtained with the same Schnupp modulation, using an additional 
photodetector, PD4. One big problem in such complicated interferometers is the 
fact that many of the control loops can work properly only after all other loops 
are locked. The operation of prototypes is essential in finding feasible schemes 
for this lock acquisition problem. The appropriate arm lengths between beam 
splitter and the four mirrors were chosen with the help of an elaborate simulation 
code, developed in close collaboration between Glasgow and Garching. 

With all of this cautious preparatory work done, acquiring lock of the signal 
recycling loop was then surprisingly easy, and gave the expected enhancement 
of the signal response [15]. It furthermore gave evidence of the important effect 
of ‘mode healing’ when signal recycling is introduced: Only those transversal 
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modes that are supported by the signal recycling cavity will be dominating at 
the output photo diode, which results in a contrast enhancement. 

At the beginning [15], the output was still dominated by an excessive fre- 
quency noise contribution from the laser. Solving this problem gave a consider- 
able improvement in sensitivity, but studies for further noise reduction are going 
on. 

10 Data Management 

An integral part of GEO 600 are the groups at Cardiff and Potsdam. In working 
on the theory of gravitational wave sources, they help to choose the right settings 
for the detector parameters. The choice of the data acquisition hardware and the 
development of the software is made in close collaboration with these groups. 
Strategies for sensitive and efficient signal detection are being developed both at 
Potsdam and Cardiff. 

11 Outlook 

The aim of GEO 600 is to have first science runs of the interferometer in the 
year 2001. This is well within the time when also other large detectors will begin 
taking data. It will be an exciting time to see gravitational wave astronomy to 
come about. 

An interesting mix of theoretical, experimental, and technical problems is yet 
ahead of us, and these tasks could very well, and with benefit for all, be tackled 
in an international collaboration, by an exchange of ideas as well as of personnel. 
The GEO groups explicitly endorse any such collaboration. 
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Abstract. The paper gives an introduction to the gravitational radiation theory of 
isolated sources and to the propagation properties of light rays in radiative gravitational 
fields. It presents a theoretical study of the generation, propagation, back-reaction, and 
detection of gravitational waves from astrophysical sources. After reviewing the various 
quadrup ole-moment laws for gravitational radiation in the Newtonian approximation, 
we show how to incorporate post-Newtonian corrections into the source multipole 
moments, the radiative multipole moments at infinity, and the back-reaction potentials. 
We further treat the light propagation in the linearized gravitational held outside a 
gravitational wave emitting source. The effects of time delay, bending of light, and 
moving source frequency shift are presented in terms of the gravitational lens potential. 
Time delay results are applied in the description of the procedure of the detection of 
gravitational waves. 
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1 Introduction 

It was only in the late fifties of the twentieth century that by the work of Her- 
mann Bondi and Joseph Weber gravitational radiation entered the domain of 
physics. Before that time gravitational radiation was not considered to be of ob- 
servational relevance and the gravitational radiation theory was not developed 
very deeply. 

The supposed detection of gravitational radiation by J. Weber in the late 
sixties triggered strong and still on-going efforts both in the building of grav- 
itational wave detectors and in the elaboration of the gravitational radiation 
theory, including investigations of the most reliable sources of detectable gravi- 
tational waves, calculations of wave forms, and analysis of data from detectors 
(cf. [1]). It turned out that coalescing neutron stars and/or stellar-mass black 
holes, together with gravitationally collapsing objects (type II supernovae), are 
the most relevant sources for detectable gravitational waves on Earth because 
they are strong and fit well to the frequency band of the Earth-based detectors 
which ranges from 10 Hz to 10 kHz. The strength of these sources is such high 
that several detection events per year might be expected in future fully devel- 
oped detectors. The most sensitive Earth-based detectors will go into operation 
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in the first few years of the new millenium. These are the laser-interferometric 
detectors in Germany, GEO600, built by a German/British consortium, in Italy, 
VIRGO, built by a Italian/French consortium, the two LIGO detectors in the 
United States, and the TAMA300 detector in Japan. There are several bar detec- 
tors already operating on Earth (ALLEGRO in the United States, AURIGA and 
NAUTILUS in Italy, EXPLORER at GERN, NIOBE in Australia). These de- 
tectors are being permanently upgraded and will be supplementing the measure- 
ments of the interferometric detectors later. For the measurement of gravitational 
waves in the frequency range between 0.1 Hz and 0.1 mHz the space-borne laser- 
interferometric detector LISA is devised which is expected to be flown around 
2010 by NASA/ESA. The astrophysical sources of the gravitational waves to 
be detected by this detector are a variety of orbiting stars (interacting white 
dwarf binaries, compact binaries), orbiting massive black holes, as well as the 
formation and coalescence of supermassive black holes. Stochastic gravitational 
waves from the early universe are expected to exist in the whole measurable 
frequency range from 10^ Hz down to 10“^® Hz. The tools to possibly measure 
the primordial waves are the mentioned Earth-based and space-borne detectors, 
Doppler tracking, pulsar timing, very long baseline interferometry, as well as the 
cosmic microwave background. 

On the theoretical side there are essentially two approaches which permit to 
investigate the properties of and to make predictions about gravitational waves 
from various sources. The first approach, that we can qualify as “exact”, stays 
within the exact theory, solving or establishing theorems about the complete 
non-linear Einstein field equations. Within this approach one can distinguish 
the work dealing with exact solutions of the field equations in the form of plane 
gravitational waves, and especially colliding plane waves. Since the waves are 
never planar in nature, this work is not very relevant to real astrophysics, but 
its academic interest is important in that it permits notably the study of the 
appearance of singularities triggered by collisions of waves. Also within the exact 
approach, but more important for applications in astrophysics, is all the work 
concerned with the study of the asymptotic structure of the gravitational field of 
isolated radiating systems. The work on asymptotics started with the papers of 
Bondi et al. [2] and Penrose [3]. The second approach is much more general, in 
the sense that it is not restricted to any particular symmetry of the system, nor it 
is applicable only in the far region of the system. However, the drawback of this 
approach is that it is only approximate and essentially looks for the solutions of 
the Einstein field equations in the form of formal expansions when c —> oo (post- 
Newtonian approximation). This approximate post-Newtonian method can be 
applied to the study of all theoretical aspects of gravitational radiation: the 
equations of motion of the source including the gravitational radiation reaction 
(works of Einstein, Infeld, and Hoffmann [4], Ghandrasekhar and Esposito [5], 
Burke and Thorne [6-8], Ehlers [9], Papapetrou and Linet [10], Damour and 
Deruelle [11,12], Schafer [13], Kopejkin [14]); the structure of the radiation field 
(work of Bonnor [15], Thorne [16], Blanche! and Damour [17]), and, more re- 
cently, accurate post-Newtonian wave generation formalisms [18-22]. 
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To the lowest, Newtonian order, the wave-generation formalism is called the 
quadrupole formalism, because as a consequence of the equality of the inertial 
and gravitational mass of all bodies the dominant radiating moment of any 
system is the (mass-type) quadrupole, which simply is at this approximation 
the standard Newtonian quadrupole moment. We are very much confident in 
using the theoretical framework of the post-Newtonian approximation because, 
marvellously enough, the framework of the Newtonian, quadrupole formalism 
has been checked by astronomical observations. 

In fact, there are two observational tests of the validity of the quadrupole 
formalism. The first test concerns the famous Hulse-Taylor binary pulsar whose 
decrease of the orbital period Pf, by gravitational radiation is predicted from the 
quadrupole formula to be [23-26] 

• 1927T MpM, l+ie" + ie4 

5c5 \ Pb J (Mp-kMc)i/3 (i_e2)7/2 ’ ^ > 

where Mp and Me are the pulsar and companion masses, e is the orbit eccen- 
tricity, and G,c are the universal gravitational constant and the speed of light. 
Numerically, one finds Pb = —2.4 x 10“^^ sec/sec, in excellent agreement (0.35% 
precision) with the observations by Taylor et al. [27,28]. The second test con- 
cerns the so-called cataclysmic variables. There we have binary systems in which 
a star filling its Roche lobe (the “secondary” with mass M 2 ) transfers mass onto 
a more massive white dwarf (the “primary” with mass Mi > M 2 ). From the for- 
mula for the angular momentum in Newtonian theory J = GMiM 2 (a/GM)^/^ 
(where M = M 1 + M 2 ), we deduce the secular evolution of the orbital semi-major 
radius a (whatever may be the mechanism for the variation of J), 

d_2J 2 M 2 f M2\ 

a J M 2 \ Ml) ’ ^ ^ 



where M 2 is the rate at which the secondary transfers mass to the primary 
{M 2 < 0). Since Mi > M 2 , the mass transfer tends to increase the radius a 
of the orbit, hence to increase the radius of the secondary’s Roche lobe, and, 
thus, to stop the mass transfer. Therefore a long lived mass transfer is possible 
only if the system looses angular momentum to compensate for the increase of 
a. For cataclismic binaries with periods longer than about two hours, the loss of 
angular momentum is explained by standard astrophysical theory (interaction 
between the magnetic field and the stellar wind of the secondary) . But for short- 
period binaries, with period less than about two hours, the only way to explain 
the loss of angular momentum is to invoke gravitational radiation. Now, from 
the quadrupole formula, we have 



/ j\ _ 32G^ M 1 M 2 M 

i J / 5c® M 

\ / GW 



(3) 



Inserting this into (2) one can then predict what should be M 2 in order that 
d/a ~ 0, and the result is in good agreement with the mass transfer measured 
from the X-rays observations of cataclysmic binaries. 
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Another important aspect of the theory of gravitational radiation, with ob- 
vious implications in astronomy, is the interaction of the gravitational wave 
field with photons. In this article we present the results of a thorough investi- 
gation of light propagation in the gravitational wave field generated by some 
isolated system. Our motivation is that electromagnetic waves are still the main 
carrier of astrophysically important information from very remote domains of 
our universe. Also, the operation of interferometric gravitational wave detec- 
tors and other techniques used for making experiments in gravitational physics 
(lunar laser ranging, very long baseline interferometry, pulsar timing, Doppler 
tracking, etc.) are fully based on the degree of our understanding of how light 
propagates in variable, time-dependent gravitational fields generated by various 
celestial bodies. Although quite a lot of work has been done on this subject (see, 
for example, [29]-[36]) a real progress and much deeper insight into the nature 
of the problem has been achieved only recently [37]- [39]. The main advantage 
of the integration technique which has been developed for finding the light-ray 
trajectory perturbed by the gravitational field is its account for the important 
physical property of gravitational radiation, namely, its retardation character. 
Previous authors, apart from Damour and Esposito-Farese [40], accounted for the 
retardation of the gravitational field only in form of plane gravitational waves. 
Hence, effects produced in the near and induction zones of isolated astronom- 
ical sources emitting waves could not be treated in full detail. As a particular 
example of importance of such effects we note the problem of detection of grav- 
itational waves created by g- modes of the Sun. The space interferometer LISA 
will be able to detect those waves. However, the problem is that LISA will fly 
in the induction zone of the emission process of these gravitational waves and, 
hence, a much more complete theoretical analysis of the working of the detector 
is needed. The approximation of a plane gravitational wave for the description 
of the detection procedure is definitely not sufficient. The other example could 
be effects caused by the time-dependent gravitational field of the ensemble of 
binary stars in our galaxy. Timing of high-stable millisecond pulsars might be a 
tool for the detection of stochastic effects produced by that field [41]. 

2 Wave generation from Isolated Sources 

2.1 Einstein Field Equations 

The gravitational field is described in general relativity solely by the metric 
tensor (and its inverse It is generated by the stress-energy tensor of 

the matter fields via the second-order differential equations 

R^'' - \g^''R = ^ > ( 4 ) 

where and R = gp^R^'^ denote, respectively, the Ricci tensor and the Ricci 
scalar. We assume that the matter tensor corresponds to an isolated source, 
i.e. has a spatially compact support with maximal radius a, and that the 
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internal gravity of the source is weak in the sense that its mass M satisfies 
GM <C ac^. Within these conditions it is appropriate to write the metric in 
the form of a small deformation of the flat metric 77^*^ = diag(— 1, 1,1,1). We pose 
{g = determinant of 5^1/) and assume that each component 
of h^'' is numerically small: <C 1. We lower and raise all indices of our 

metric perturbation with the flat metric; for instance, h^i, = and 

h = rj^^hprj. Then the held equations (4) can be re-written in terms of the metric 
perturbation by separating out a second-order linear operator acting on 
and the remaining part of the equations, which is at least quadratic in and 
its first and second derivatives, we conventionally set to the right side of the 
equations together with the matter tensor. This yields 



□ + r]>^''dpHP = 



16ttG 



( 5 ) 



where 0 = 0^ denotes the flat d’Alembertian operator and where = d^hP’'] 
on the right side of the equation we have put 



= {-g) TP’' + 



IGttG 



AP’' , 



( 6 ) 



which represents the total stress-energy distribution of both the matter fields 
- first term in (6) - and the gravitational held itself ~ second term involving 
the non-linear gravitational source AP’' = 0{h?) (note that tP’' transforms as a 
Minkowskian tensor under Lorentz transformations). The divergence of the left 
side of (6) is identically zero by virtue of the Bianchi identities, therefore the 
pseudo-tensor tP’' is conserved in the ordinary sense, 



B^tP’' = 0 , 



( 7 ) 



which is equivalent to the covariant conservation of the matter tensor, V^TP’' = 
0. A gauge transformation hP’' — >■ hP’' + dP^’' + d‘'^P — gP’'d\^^ does not affect 
the left side of (5), and consequently by solving for a vector ^p the wave equation 
= —HP one can arrange that hP’' satisfies the harmonic- gauge condition 
duhP’' = 0. In this gauge the held equations (5) simply become 



□ hP’' = 



16ttG 



(8) 



We want now to formulate the condition that the source is really isolated, 
i.e. it does not receive any radiation from other sources located far away, at 
infinity. Recall that we can express any homogeneous regular solution of the 
wave equation D/ihom = 0 at a given held point in terms of the values of ft-hom at 
some source points forming a surrounding surface at retarded times. This is the 
Kirchhoff formula (see e.g. [42]), which reads in the case where the surrounding 
surface is a sphere, 



/lhom(x',t') = 11 ^ 



d . , . 9 

~^^\ph\iora) (P^hom j 



(x,t), 



(9) 
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where p = |x — x'| and t = t' — p/c. To formulate the no-incoming radiation 
condition we say that there should be no such homogeneous regular solutions 
^hom (since they correspond physically to waves propagating from sources at 
infinity). Taking the limit r — >■ -|-oo with t + r j c= const, in Kirchhoff’s formula, 
we then arrive at the physical conditions that 



lim 

t-\- ^ =const. 






(x, t) = 0 , 



(10) 



and that rd\h^'^ should be bounded in this limit. The no-incoming radiation 
condition (10) is thus imposed at (Minkowskian) past null infinity in a 
conformally rescaled space-time diagram. 



2.2 Multipole Expansion in Linearized Gravity 

For the rest of this Section (and also in Section 5) we shall restrict ourselves to 
the case of linearized gravity, defined in particular by the neglect of the non- 
linear gravitational source term for which the field equations in harmonic 
gauge = 0 read „ 

□ . ( 11 ) 

o'* 

Within the linearized approximation the matter stress-energy tensor is diver- 
genceless: = 0. Therefore the linearized approximation is inconsistent as 

regards the motion of the matter source, which in this approximation stays un- 
affected by the gravitational field. However this approximation is quite adequate 
for describing the generation of waves by a given source (for instance acted on 
by non-gravitational forces). From the no-incoming radiation condition (10), we 
find that the unique solution of (11) is the retarded one: 

AC f rl^-x' 1 

/.-(x,t) = -^y ^^T-(x',t--|x-x'|). (12) 

Since we are being interested in the wave-generation problem, we choose the 
field point outside the source, that is r = |x| > a (with the origin of the spatial 
coordinates at the center of the ball with radius a, so a > |x'|), and we decompose 
in that region into “multipole moments” . The straightforward way to do this 
is to employ the standard Taylor formula for the formal limit x' — >■ 0, 



T(x',t- |x-x'|/c) 



+00 

-E 

1=0 



(-)' 



x'l9l 



'T{yi',t-r/cy 



(13) 



Notice the short-hand notation L = iii 2 ■ ■ ■ ii for a multi-index with I indices, as 
well as x']^ = x'^ = • • • x'*', = di^di^ ' " dp where di = djdx'- . From 

this Taylor expansion we immediately arrive at the following expression for the 
multipole decomposition of the metric perturbation. 



7W(/r''")(x,t) 



4G {-Y 
ll 

1=0 



-nr it--) 



(14) 
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where the “multipole moments” depend on the retarded time u = t — r jc and 
are given by 

= J ,u) , (15) 

In (14) we employ the notation A4 to distinguish the multipole expansion A4(h) 
from h itself. Of course, we have numerically Ai{h) = h outside the source, 
however inside the source Ai{h) and h will differ from each other; indeed h is 
a smooth solution of the equations (12) while Ai{h) satisfies □Al(h) = 0 and 
becomes infinite when r — i 0 [as it is clear from (14)]. In Section 4, dealing with 
the general case of the non-linear theory, we shall prefer to use the multipole 
expansion in terms of symmetric and trace-free (STF) multipole moments. In 
the present case it is simpler to use the non-STF moments A systematic 
investigation of the STF multipole expansion in linearized gravity can be found 
in [43]. 

Applying the (linearized) conservation law = 0 we easily find cer- 

tain physical evolution equations (and conservation laws) to be satisfied by the 
multipole moments (15). Making use of the Gauss theorem to discard spatial 
divergences of compact-support terms we successively obtain (with (n) referring 
to n time-derivatives) 

1 

- nf = , (16) 

\nf = (17) 

where the round brackets around spatial indices denote the symmetrization (and 
where L — 1 = A • • • b_i; L — 2 = A • • • q_ 2 ). As a consequence of (16) we see 
that the anti-symmetric part of in the indices ij is constant. A more general 
consequence is 

Kjk K\-1= (1 - . (18) 

In the case of the lowest-order (1 = 0 and ^ = 1) multipole moments the right 
sides vanish, and therefore these equations represent the conservation laws for 
the corresponding moments. Over all we find ten conservation laws, one for the 
mass-type monopole or total mass M, three for the mass-type dipole or center 
of mass position (times M), three for the time derivative of the mass dipole 
or linear momentum Pi, and three for the current- type dipole or total angular 



momentum Si. Specifically, we define 






M = 


II 

O 

o 

IT) 


r 7^00 


(19) 


P^ = 


= 

c 


/a IT, 


(20) 


s^ = - 

c 


eijk'H^^ = 


^ijk 1 d ^ Xj , 


(21) 
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MX, 




Then, from (16)-(18), we have 
M = 0, F, = 0, 



^i = 0, 



and 




(22) 



(23) 



3 The Quadrupole Moment Formalism 



3.1 Multipole Expansion in the Far Region 

We analyze the gravitational field in the far-zone of the source, in which we 
perform the expansion of the multipolar expansion when r — >■ +oo with 

t — r /c =const (Minkowskian future null infinity J^^). To leading order 1/r the 
formula (14) yields 







(24) 



[where (/) represents the Ith time-derivative and where 

with n* = jr\. Because of the powers of 1/c in front of each multipolar piece, 
it is clear that the far-zone expansion of A4(h) is especially useful when the 
numerical value of each term of the formula (24) really scales with the factor 
1 /c* in front of it. This will be the case when the typical velocities of the particles 
composing the system are small with respect to the speed of light, u/c = £ <C 1, 
or, equivalently, when the maximal radius a of the system is much smaller than 
the wavelength A of the emitted gravitational radiation (A = cP where P is 
the typical period of the internal motion). In particular, this “slow motion” 
assumption is always realized in the case of a self-gravitating system with weak 
internal gravity, for which we have 





(25) 



Thus, for slowly-moving systems we can retain only the first few terms in 
the multipolar-post-Newtonian expansion (24). Let us restrict ourselves to the 
terms 



/r™ = - 



4G 



H 



00 



'Oi 



- 

c'^r [ 



n 



Oi 



Ua 

C 



( 1 ) 



(1) . 



n. 



4G 

c^r 



( 2 ) 

T/OO 

'2c2 



(26) 

(27) 

(28) 



For easier notation we do not indicate the multipole expansion M, nor the 
neglected 0(l/r^) in the distance to the source. Using the conservation laws 
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(16)-(17) we can easily re-express the latter expressions in terms of the total 
mass M, total linear-momentum Pi, and the Newtonian quadrupole moment 

1 r rpOO 

Qij = = / (fx-^XiXj . (29) 

Since M and Pi are conserved they do not participate to the radiation field which 
is therefore dominantly quadrupolar. Restoring the neglected post-Newtonian 
error terms we obtain 

^ Qab{u) + 0 (e^)| , (30) 

Qai (■*^) + *^ (^^) I > (31) 

4G f 1 (2) I 

Note that the contribution of the angular momentum Si appears only at the sub- 
dominant order 0(l/r^) [see for instance (78) below]. When acting on terms of 
order 1/r in the distance like in (30)-(32) the derivative di, is proportional to the 
(Minkowskian) null vector ki, = (— l,n); namely = —k^d^ + 0(l/r^). Using 
this, the harmonic gauge condition = 0 reads 

(1) / 1 \ 

O j (33) 

which is checked directly to be satisfied by the expressions (30)-(32). 

3.2 The Far-Field Quadrupole Formula 

Using the gauge freedom — >• we apply a gauge 

transformation to what is called the transverse-traceless (TT) gauge. Namely we 
pose 



e° = 
e = 



G 

2rc^ 

G 

2rc^ 



( 2 ) ( 2 ) 
Tlab Q ab Qa 



(2) (2) (2) 

^iab Q ab Q aa 471-0 Q i 



(34) 

(35) 



The new metric in TT coordinates, say /iJJ (where we lower indices with the 
flat metric), is straightforwardly seen to involve in its 00 and Oi components only 
the static contributions of the mass monopole and dipoles, namely 



7,tt _ _ 
''■00 — 



AGM 






/,tt _ _ 

''Oi — 



AGP, 



(36) 
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In the TT gauge the only radiating components of the field are the spatial ones, 
ij, and we obtain 

= --^Pijab(n)i^Iab (w) + O (e)| + O • (37) 



The latter equation is known as the far-field quadrupole equation. The TT pro- 
jection operator is defined by Pijab = PiaPjb — \PijPab, where Pij = 6ij — riirij. 
This is a projector: namely PijkiPuab = Pijab, onto the plane orthogonal to n: 
thus for instance, riiPijab = 0; and it is trace free: Pijab^ab = 0, so we have 
substituted in (37) the trace free part of the quadrupole moment Qab, he. 

lij = Qij ^ Qaa + O (e^) . (38) 

We have added a remainder 0{e^) to indicate the post-Newtonian corrections in 
the source moment II computed in Section 4 [note that the remainder in (37) 
is only 0{e)]. 

As we see all the physical properties of the gravitational wave are contained 
into the TT projection (37). As a consequence, the effects of the wave on matter 
fields are transverse: the motion of matter induced by the wave takes place only 
in the plane orthogonal to the propagation of the wave. Furthermore, from the 
trace-less character of the wave we see there can be only two independent compo- 
nents or polarization states. We introduce two polarization vectors, p and q, in 
the plane orthogonal to the direction of propagation n, forming an orthonormal 
right-handed triad. In terms of these polarization vectors the projector onto the 
transverse plane reads Pij = PiPj + qiQj- The two polarization states (customarily 
referred to as the “plus” and “cross” polarizations) are defined by 



^ PiPj - QiQj ^TT 



ij , 



f, _ PiQj PPjQi f,TT 

«-x - ^ riij 



(39) 



Until very recently all expectations was that any astrophysical (slowly- 
moving) source would emit gravitational radiation according to (at least dom- 
inantly) the quadrupole formula (37), involving the mass-type quadrupole mo- 
ment lij . For instance the waves from the binary pulsar obey this formula. How- 
ever, it has been realized by Andersson [44] and Friedman and Morsink [45] that 
in the case of the secular instability of the r-modes (rotation, or Rossby modes) 
of isolated newly-born neutron stars, the gravitational radiation is dominated by 
the variation of the current-type quadrupole moment Jij. Here we give, without 
proof, the formula analogous to (37) but for the current quadrupole: 



lTT 



8G (2) 

Pijab{^^ ^acd Jbd (^) , 



'bj I current 

where the (trace-free) current quadrupole moment is given by 

nOb 



/ rpOb 

d X X j 'jXa -f G ^ . 



(40) 



(41) 
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It could be, rather ironically, that the first direct detection of gravitational waves 
would follow the formula (40) rather than the classic formula (37) appearing in 
all text-books such as [8] . 



3.3 Energy Balance Equation and Radiation Reaction 

The stress-energy pseudo-tensor of all matter and gravitational fields (in har- 
monic coordinates) is defined by (6). Now, for gravitational waves propa- 
gating in vacuum at large distances from their sources (in regions where the 
waves are almost planar), it is appropriate to define the stress-energy tensor of 
the waves as the gravitational source term [involving in the definition of 

in which is replaced by the far-held metric (37). Since the expression of 
the metric is valid up to fractional terms 0(l/r^) in the distance, and since 
is at least quadratic in h, the stress-energy tensor of gravitational waves will be 
valid up to 0{l/r^). Thus, we dehne, in the far-zone. 



Now to quadratic order is a complicated sum of terms like hddh + dhdh. 
But when using (33) together with the fact that = 0, this sum simplihes 
drastically and we end up with [still neglecting 0(l/r^)] 



2 ( 1 ) ( 1 ) 2 
^^~ 327tG ~ 167TG 



r (1) , 
(h+r 



(1) J 

(h.r 



(43) 



The second form is obtained from the inverse of (39): = {piPj — qiqj)h+ + 

{Piqj +Pjqi)hx- The expression (43) takes the classic form a k^k'' of the stress- 
energy tensor for a swarm of massless particles (gravitons) moving with the speed 
of light. Notice from (43) that the energy density of waves is positive dehnite. 
In the general case where we do not neglect the terms 0(l/r^) the previous 
expressions of are still valid, but provided that one performs a suitable 
average over several gravitational wavelengths (see [8]). For quadrupole waves, 
substituting the quadrupole formula (37), we get 



G (3) (3) 

^Gw = I ,, I ,, . (44) 

We can integrate the conservation law = 0 over the usual three- 

dimensional space (volume element (Px), and use the Gauss theorem to obtain a 
flux of Tqw through a surface at infinity (exterior surface element dSi), so that 

^ ^ I 

We consider the /x = 0 component of this law, substitute for the expression 
(44) at the quadrupole approximation, perform the angular integration assuming 
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for simplicity a coordinate sphere at infinity (i.e. dSi = r^iiidiT), and obtain the 
famous Einstein (mass-type) quadrupole formula 



dE 

dt 



.^2 
5c^ \ 



f(3) 

/ 



(3) 

ij I ij 



+ 0 {e 



(46) 



where E = J d^x represents the energy (matter -|- gravitation) of the source, 
and where we re-installed the correct post-Newtonian remainder 0(£^). Without 
proof we give also the formula corresponding to the current-type quadrupole 
moment. 



dt ) 



I current 



16G (3) (3) 

■4^ 



(47) 



where the current quadrupole moment is defined by (41). 

Interestingly, we can treat the decrease of the energy as the result of the back- 
action of a radiative force (cf. [46]). We operate by parts the time-derivatives in 
(46) so as to obtain 




G (1) (5) 



(48) 



where we put on the left side a term in the form of a total time-derivative, 
representing a correction of order 1/c® to the energy E, given by 



SE 5 _ G [(3) (2) (4) (1) ■ 

“ 5^ ^ ^ b “ I b I ij 



(49) 



Now, after a time much longer than the characteristic period of the source (for 
definiteness one can consider a quasi-periodic source or perform a suitable aver- 
age; see e.g. [47]), the contribution due to the correcting term (49) will become 
negligible as compared to the right hand side of (48). Therefore, in the long 
term, we can ignore this term and finally, the equation (48) can be re-written 
equivalently in a form where the energy loss in the source is the result of the 
work of a radiation reaction force Fj-gao namely 

^ = -y Freac • V (50) 

where 

d?eac(^> P ^3 > P = ■ (51) 

The equation (50)-(51) is called the radiation-reaction quadrupole formula; the 
specific expression (51) of the radiation reaction force is called after Burke and 
Thorne [6-8]. This force is to be interpreted as a small Newtonian- like force 
superposed to the usual Newtonian force at the 2.5PN order (or e®). Actually, 
the Burke-Thorne radiation reaction force is valid only in a special gauge. That 
is, only in a special gauge, differing for instance from the harmonic or ADM 
gauges, does the source equation of motion involve at 2.5PN order the correcting 
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force (50)-(51). (See [48] for a discussion of various expressions of the radiation- 
reaction force in different gauges.) Notice that the reaction force (51) contains 
time derivatives up to the fifth order inclusively. In practice, for implementation 
in numerical codes, high time-derivatives have the tendency of decreasing the 
precision of a numerical computation, and therefore it is advantageous to choose 
other expressions of the reaction force for implementation in numerical codes 
[49,50] . On the other hand, the order of second, third, and higher time-derivatives 
can be reduced by making use of the Newtonian equations of motion of the matter 
source. Subsequent implication of such a form of the radiation reaction in binary 
systems leads, for example, to the theoretical prediction of the rate of orbital 
decay shown in (1). 



4 Post-Newtonian Gravitational Radiation 

4.1 The Multipole Moments in the Post-Newtonian Approximation 

In Section 2 we presented the formula for the multipole expansion of the field 
outside the source in linearized gravity. In the present section let us present, 
without proof, the corresponding formula in the full non-Zmear theory, i.e. when 
the Einstein field equations (8) are solved taking into account the gravitational 
source term . The formula will be valid whenever the post-Newtonian ex- 
pansion is valid, i.e. when (25) holds. Under this assumption the field in the 
near-zone of a slowly-moving source can be expanded in non-analytic (involving 
logarithms) series of 1/c [17]. The general structure of the expansion is 

X, c) = ^ {t, x) , (52) 

p,q 

where are the functional coefficients of the expansion (p, q = integers, in- 
cluding the zero). The general multipole expansion of the metric field Ai(h) is 
found by requiring that when re-developed in the near-zone in the limit of the 
parameter r/c — >■ 0 (which is equivalent with the formal re-expansion in the 
limit c — >■ oo) it matches with the previous post-Newtonian expansion (52) in 
the sense of the mathematical techniques of matched asymptotic expansions, i.e. 

M{h)=M(h) . (53) 

It is worthwhile noting that the equality (53) should be true in the sense of formal 
series, i.e. term by term in each coefficient after both sides of the equation are 
re-arranged with respect to the same expansion parameter. 

We find [21,51] that the multipole expansion generalizing (14) to the full 
theory is composed of two terms, 

„ +00 . y , . 

M{hn = finite p^Tta-^[M{An] ~ ^ E \ ~ r/c) \ , 

^ 1=0 ' ^ ^ 



( 54 ) 
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where is the inverse flat space-time retarded operator. Herein, the first term 
is a particular solution of the Einstein held equations outside the matter compact 
support, i.e. it satisfies D/ipart = the second term is a retarded solution 

of the source-free (homogeneous) wave equation, i.e. = t). The “multipole 

moments” parametrizing this homogeneous solution are given explicitly by an 
expression similar to (15), 

'H^'^(m) = finite part J , (55) 

but involving in place of the matter stress-energy tensor the post- Newtonian 
expansion , in the sense of (52), of the total (matter-hgravitation) pseudo- 
tensor defined by (6). Both terms in (54) involve an operation of taking 
the finite part. This finite part can be defined precisely by means of an analytic 
continuation (see [51] for details), but in fact it is basically equivalent to taking 
the finite part of a divergent integral in the sense of Hadamard [52] . Notice in 
particular that the finite part in the expression of the multipole moments (55) 
deals with the behaviour of the integral at infinity: r — >■ oo (without the finite 
part the integral would be divergent because of the factor xl ^ in the inte- 
grand and the fact that the pseudo-tensor is not of compact support). One 
can show that the multipole expansion (54)-(55) is equivalent with a different 
one proposed recently by Will and Wiseman [22]. 

Generally, it is more useful (for applications) to express the multipole ex- 
pansion not in terms of the moments (55), but in terms of symmetric trace-free 
(STF) moments. We denote the STF projection with a hat, xl = STF(x'^), so 
that, for instance, Xij = XiXj — Then it can be shown that the STF 

multipole expansion equivalent to (54)-(55) reads, 

M(hn = finite ^ E ~ r/c)\ , 

(56) 

where the parametrizing multipole moments are a bit more complicated, 

d^^^(u) = finite part J dfyi J dz z\x\/c) . (57) 

With respect to the non-tracefree expression (55) this involves an extra integra- 
tion over the variable z, with weighting function 

Hz) = - zH , J ^dz6i{z) = 1 , = S{z) . (58) 

The results (56)-(58) permit us to define a very convenient notion of the 
source multipole moments. Quite naturally, these are constructed from the ten 
components of T^iu). First of all, we reduce the number of independent com- 
ponents to only six by using the four relations given by the harmonic gauge 
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condition = 0. Next we apply standard STF techniques (see [18,43,51] for 

details), and, in this way, we are able to define six STF-irreducible multipole 
moments, denoted Xl,Yl, Zl, which are given by explicit integrals 

extending over the post-Newtonian-expanded pseudo-tensor like in (57). All 
of the moments are referred to as the moments of the source; 

however notice that, among them, only the moments (mass-type moment) 
and Jl (current- type) play a physical role at the linearized level. The other four 
moments Wl, simply parametrize a linear gauge transformation and 

can often be omitted from the consideration. Only at the order 2.5PN or do 
they start playing a physical role. The complete formulas for the moments II , Jl 
are [51] 



Il{u) 



Jl{u) 



finite part / / dz< Six^E — 



4(2/ -h 1) 






J J-1 

2(2/ -hi) 

'c4(/ + 1)(/ + 2)(2/ + 5) 

nl 



c2(/+l)(2/ + 3) 
5i+2Xijhd'iXiX{yi,u + z\x.\/c) , (59) 



Unite part J d^x J dz eab<n^SiXL-i>aEb 

-Si+iXl — l>ac dtSbc\{:x:,u + z\x\/c) . (60) 



2/ -hi 
c2(/-h2)(2/-h3)' 



In these expressions, <> refers to the STF projection, and we have posed 

=00 =0i 

2: = i (where r“ = , Ei = — , E,, = . (61) 

c 

The moments II, Jl given by these formulas are valid formally up to any post- 
Newtonian order. They constitute the generalization in the non-linear theory of 
the Newtonian moments introduced earlier in (38) and (41). 

In order to apply usefully these moments to a given problem, one must find 
the explicit expressions of the moments at a given post-Newtonian order by 
inserting into them the components of the pseudo-tensor obtained from an 
explicit post-Newtonian algorithm. Without entering into details, we find for 
instance that at the IPN order the mass-type source moment /l is given (rather 
remarquably) by a simple compact-support formula [17,21], on which we can, 
therefore, remove the finite part prescription: 



II 




XLcr + 



|x|^XL 2 4(2/ -k l)XiL „ 

2c2(2/ + 3) c2(/+l)(2/ + 3)“^* 



+ 0(£^) 



(62) 



We denote the compact-support parts of the source scalar and vector densities 
in (61) by 



2^00 _|_ j^ii 





(63) 



See Blanche! and Schafer [53] for application of the formula (62) to the compu- 
tation of the relativistic correction in the Pt of a binary pulsar [given to lowest 
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order by (1)]. On the other hand, Damour, Soffel, and Xu [54] (extending previ- 
ous work of Brumberg and Kopeikin [55], [56]) used the formula in their study 
of the Solar-system dynamics at IPN order. The property of being of compact 
support is a special feature of the IPN mass-moment II- To higher-order (2PN 
and higher) the mass moment /l is intrinsically of non-compact support (see 
its expression in [21]); hence the finite part prescription in the definition of the 
moment II plays a crucial role at 2PN. Similarly, starting already at IPN order, 
the current-moment Jl is intrinsically of non-compact support [19,21]. 

In a linear theory, the source multipole moments coincide evidently with 
the radiative multipole moments, defined as the coefficients of the multipole 
expansion of the 1/r term in the distance to the source at retarded times t — 
rjc =const. (this is evident from Section 2). However, in a non-linear theory 
like general relativity, the source multipole moments interact with each other 
in the exterior field through the non-linearities. This is clear from the presence 
of the first term in (56), containing the gravitational source yl^^, and which 
does contribute to the 1 jr part of the metric at infinity. Therefore the source 
multipole moments must be related to the radiative ones, the latter constituting 
in this approach the actual observables of the field at infinity. 

In the TT projection of the metric field one can define two sets of radiative 
moments U l (mass-type) and Vl (current-type) . The definition of these moments 
is that they parametrize the 1 /r-term of the ij components of the metric in TT 
gauge. Thus, extending the formula (37), the radiative moments are given by 
the decomposition 

,TT r> ^ f TT TT 1 /n ^ 

(64) 

The radiative moments Ul,Vl are related to the /-th time-derivatives of the 
corresponding source moments. Let us give, without proof, the result of the 
connection of the radiative moments to the source moments (59)-(60) to the order 
(or 1.5PN) inclusively. To this order some non-linear “monopole-radiative l- 
pole” interactions appear, which correspond physically to the scattering of the 
Lpole wave on the static curvature induced by the total mass of the source (i.e. 
the mass monopole M = /) - an effect well known under the name of tail of 
gravitational waves. We find [20,21] 

Ul{u) = / L (m) H ^ J dr I L (u-t) In (^—j + ki +0 , (65) 

Vl{u) = J L (u) -\ ^ J dr J L (u-r) In j + tt/ + O . (66) 

The same expressions come out from the Will and Wiseman formalism [22]. 
Here, 6 is a normalization constant (essentially irrelevant since it corresponds to 
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a choice of the origin of time in the far zone), and ki, tt; are given by 



Kl 



2f + 51 + 4 
l{l+l){l + 2) 



1-2 

E 

fc=i 



TT; 



l-l 



(67) 



The first term in (65) relates essentially to the original quadrupole formula (see 
(37)). From (65)-(66) one sees that the first non-linearity in the propagation 
of the waves is at 1.5PN order with respect to the quadrupole formula. Thus, 
with enough precision, one can replace in (65) the mass-type moment /l by its 
compact-support expression that we computed in (62) [since the post-Newtonian 
remainder in (62) is 0{£‘^)]. 



4.2 Post-Newtonian Radiation Reaction 



Emission of gravitational radiation affects the equations of motion of an isolated 
system dominantly at the 2.5PN order beyond the Newtonian acceleration. In a 
suitable gauge the radiation-reaction force density at the 2.5PN order is given by 
the quadrupole formula (51). In this Section we extend this quadrupole formula 
to include the relativistic corrections up to the relative 1.5PN order, which means 
the absolute 4PN order with respect to the Newtonian force. The method is to 
compute the radiation reaction by means of the matching [in the sense of (53)] 
of the post-Newtonian field to the exterior multipolar field. Indeed, recall that 
the post-Newtonian field is valid only in the near zone, and, thus, only via a 
matching can it incorporate information from the correct boundary condition, 
viz the no-incoming radiation condition imposed at infinity by equation (10), 
which specifies the braking character of gravitational radiation reaction. 

To the relative 1.5PN order, and in a suitable gauge, it can be shown that 
the reaction force derives from some “electromagnetic-like” scalar and vector 
reaction potentials and Explicitly we have [57] 



P'--“(x,t) = -^c 



G 

4 — 7 
c' 

P--(x,t) = ^ 



(5) 



4GM 



r+oo 



/o 



*‘"(i) 



(7) 

/ ij {t - t) 



1 (7) . , _ J_ 2 

/ ijk [t) — —X Xij I ij {t) 



■ 1 , M 4 (5) 

I jk J km 



+ 0{e^) , ( 68 ) 

+ 0(e^) . (69) 



The dominant term in the formula (68)-(69) is the standard Burke-Thorne re- 
active scalar potential at 2.5PN order [compare (68) with (51)]. In this term, 
consistently with the approximation, one must insert the IPN expression of the 
moment as given by (62). The Burke-Thorne term is of “odd” -parity-type as it 
corresponds to an odd power of 1 /c, and thus changes sign upon a time reversal 
(or more precisely when we replace the retarded potentials by advanced ones). 
Similarly is the next term in (68)-(69) is 3.5PN (i.e. £^), involving both the 
mass-quadrupole, mass-octupole and current-quadrupole moments (the term 
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in the vector potential corresponds really to in the equations of mo- 

tion). However, notice that the next term in the reaction scalar potential 
at 4PN or £® order, belongs to the “even”-parity-type. Nevertheless this term is 
really part of the radiation reaction for it is not invariant under a time rever- 
sal, as is involves an integration over the “past” history of the source, so that 
when changing the retarded potentials to advanced ones the integration range 
would change to the whole “future”; hence, this term does not stay invariant. It 
represents the contribution of tails in the radiation-reaction force, and is nicely 
consistent, in the sense of energy conservation, with the tails in the far zone 
[equation (65)]. For explicit computations of the back-reaction to 3.5PN order in 
the case of point-mass binary systems see Iyer and Will [58,59], and Jaranowski 
and Schafer [60]. 

Using the matching (53) one finds that the near-zone post-Newtonian metric 
(to 1.5PN relative order in both the “damping” and “conservative” effects) is 
parametrized in this gauge by some generalized potentials 

= □-(„ [-4^Ga^] + . (70) 

The first term represents, to this post-Newtonian order, the conservative part of 
the metric; it is of the normal “even” -parity- type and is given by the usual 
symmetric integral (half-retarded plus half-advanced) of the source densities 
= (cr, Ui) given by (63). The second term V))®^'® denotes the radiation-reaction 
potentials (68)-(69). By inserting the metric parametrized by (70) into the equa- 
tions of motion of the source (i.e. = 0 <G> = 0), and considering 

the integral of energy, we obtain the balance equation 

^ = y + o (e9) . (71) 



Here E denotes the energy of the source at the IPN (or even 1.5PN) order. 
Actually what we obtain is not E but some E + SE^/c’ + 5Ei/c^ like in (48). 
Arguing as before we neglect these 5E^ and SE'j. Substituting now the expres- 
sions (68)-(69) for the reactive potentials (and neglecting other 5E's) we get 



dE 

dt 



G 
5 c® 



f(3) 2GM 




G 



1 (4) (4) 16 (3) (3) 

I ijk I ijk +Y 5 b b 



+ 0 (£®) , 




(72) 



The right-side is exactly in agreement with the computation of the total flux 
energy emitted in gravitational waves at infinity, which is computed making use 
of the stress-energy tensor of gravitational waves (43). In particular we recover 
in the brackets of the first term of (72) the third time-derivative of the radiative 
moment Uij including its tail contribution. The difference with the standard 
derivation of the flux is that instead of computing a surface integral at infinity 
we have performed the computation completely within the source, using the local 
source equations of motion. 
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5 Light Propagation in Gravitational Fields 
of Isolated Sonrces 



5.1 General Solution of the Light Propagation Equation 



We are going now to calculate in linearized approximation the propagation of 
a light ray in the gravitational field of an isolated source at rest showing a 
mass-monopole, a spin-dipole and a time-dependent mass-quadrupole moment. 
In linearized approximation, the propagation equation for a particle (massless 
or with mass) with space coordinates x*(t) reads 



^ (f) — 90i,t 2^00, idOi.k 90k,i^ ^ 



'i 



1 



9m,kx"x“ - \9ik,j - ^9kj,i j + I ^9kj,t ~ 9ok,j I x"x^x" , 



1 






where the metric coefficients in linear approximation, are related 

with h^'' from Sections 2-4 through = —h^i, + \rjfn,h. The dots denote 
differentiation with respect to time t and c = 1 has been put for simplicity. In 
the linear approximation scheme, the velocity i* appearing on the right-hand- 
side of (73) can be treated as a constant vector. For massless particles (photons) 
it has unit length, i.e. x^x^ = 1. In the following we use i* = fc* in the right hand 
side of (73). 

The unperturbed motion of photons reads 

x\t) = xl + k\t - to) , (74) 

where Xq denotes the position of the photon at time of emission to- For solving 
the light propagation equation (73) it is very convenient to introduce the new 
time parameter r defined by t = t — t*, where t* denotes the time of the closest 
approach of the photon to the source of the gravitational field. Then it holds 

x*(r) = C + k^'T , (75) 



where is the vector pointing from the position of the source to the position of 
the photon at the closest approach. Its length is the impact parameter |^| = d. 
Obviously, and are orthogonal to each other in the euclidean sense, i.e. 

= 0. Therefore, the length of x*, r = |x|, takes the simple form r = '/r^~+~(P. 
Introducing the derivatives di = Pijd/d^^ and dr = d/ dr, where Pij = Sij — kikj 
is the projection operator onto the plane orthogonal to fc*, allows the light- 
propagation equation to be written as 

, (76) 

where the four-dimensional vector reads = (l,fc*). 

To get a complete overview of the influence of a gravitational wave, emit- 
ted from an isolated source, on the propagation of light rays, we use the rep- 
resentation of the metric coefficients (14) which is valid all-over in the space 
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outside a domain which includes the matter source. Splitting into a canon- 
ical part which contains trace-free tensors only, and a gauge part, i.e. 
ffiu = + di,w^, we obtain in case of mass-monopole, spin-dipole, 

and mass-quadrupole source moments (remember the source being at rest) 



rcan 

Joo 



2M 

r 



Ipq{t-r) 



(77) 



.rcan 

JOi 




SpTl 

2 



q 



+ 2dj 



r 



(78) 



^can 

Jij 



SijfSo"' + - r) ■ 



(79) 



Herein, for simplicity, we have put G = 1, and di = djdx^. The mass M, spin 
S'*, and the quadrupole moment lij of the source of gravitational waves are 
given (in the Newtonian approximation) by the expressions (19), (21), and (38). 
The explicit expressions for the gauge functions relating with are 
important for general discussion of light propagation in the field of gravitational 
waves emitted by the isolated source. However, for the sake of simplicity they 
will be omitted. Their precise form can be found in [37]. 

The insertion of these expressions (77)-(79) into the equation (76) results in 
the equation 



x*(r) 



2M {d^ - k,dr^ - 2S^ {e.pqdqr 



^j^jpq^iq 




+ (^dipq — kidpqr + 2kpdiqr^ ^ J’ 

-drr [w^ + cp^-k^ K+7>°)] , 



(80) 



where the vector ip^ denotes terms which are of gauge type. The precise form of 
is not important here and can be found in [37]. 

The solution of equation (80), using the boundary conditions i*(— oo) = A:* 
and s*(to) = Xq - emission point in space of the light ray at time tq, reads 

i*(r) = fc* + H*(r), (81) 

x\t) = x]q{T) + S\t) ~ S\tq) , (82) 



where x]^{t) denotes the unperturbed trajectory (75). The relativistic pertur- 
bation of the trajectory is given by 

S'*(r) = (2Md^ + 2S^kj£jpqdi^ A{t, ^) -k dipqBpq{r, - (83) 

(2Mk, + 2SPe,pqdq) 1^1 - {k,dpq ~ 2kpd,q) | | 
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S\t) = (2Md, + B{t, 0 - (2Mh + A{t, (84) 

+ dipgDpq{T,^) - (k^^pq - 2kpdiq^ Bpq{T,C) ~ 2PijdqCjg{T, ~ 

- + fc* [w°(r,C) + , 

whereby the scalar functions A and B and derivatives of the tensors B^j , Cij , Dij 
are known fully explicitly. They are given by 



Mt, 0 

B{t,0 

dkB,j{T, 0 
dkC,j{T,0 

dijkDpgir, 
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J A{t, Cjdr = — rln (^\/ (P + — \/ P + P , 



{yr) ^ Iij{t - r)^'" , 

{yr)~^ iijP - r)£p , 
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(85) 

(86) 

(87) 

(88) 



pij + ^ \ dkBpgir, 0 + P^^d,Bpg{T, £) + ed^kBpgir, ?) , 

(89) 



where the variable y = t — pp + P is the retarded time argument for the 
photon which passes through the point of closest approach to the source of the 
gravitational radiation at time t* = 0. More details concerning the method of 
calculation of light ray trajectory in time dependent gravitational fields can be 
found in [37,38]. 



5.2 Time Delay and Bending of Light 

The time delay results in the form 

t - to = |x - xoj - k • S’(r) + k • S’(ro) , (90) 



or 



t-to = |x - xoj + AmHAo) + As{t,to) + AQ{t,to) , (91) 

where jx — xgj is the usual Euclidean distance between the points of emission, 
Xq, and reception, x, of the photon. Am is the classical Shapiro delay produced 
by the (constant) spherically symmetric part of the gravitational field of the 
deflector (see, e.g. [8]), As is the Lense-Thirring or Kerr delay due to the (con- 
stant) spin of the localized source of gravitational waves [39], and Aq describes 
an additional delay caused by the time dependent quadrupole moment of the 
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source [37]. Specifically we obtain 

Am = 2MIii ^ (92) 

Jo + To_ 

As =-2e,,kk^S'^dAn , (93) 

L^O + ToJ 

Aq = d^j [B,j{T,^) - B,j{tq,^)] + Sq{t,^) - Sq{to,^) , (94) 

where 

Sq{t, ^) = k'- {w^ + ip'-) - w° - ip° . (95) 

Let us now denote by a* the dimensionless vector describing the total angle 
of deflection of the light ray measured at the point of observation and calculated 
with respect to vector /c* given at past null infinity. It is defined according to the 
relationship 

a*(r,C) = fc*[k- S'(t,^)] - S'XgC) = - Pij S’^(t,C) ■ (96) 

For observers being far away from the source of the gravitational wave the 
projection of the mass-quadrupole tensor of the source of gravitational radiation 
onto the plane orthogonal to the propagation direction of the gravitational wave 
is the crucial object which enters into the observable effects. It reads 

I^j — Bijpqipq — Fjj — (^Sij -\-TliTlj') TlpTlq Ipq i^^ip’yijTlq-\-6jpTliTiq) Ipq , (97) 

where again we denote rii = /r. 

In the case of small impact parameter d {d/rg <C 1, d/r <C 1) we respectively 
obtain for the time delay and the angle of deflection 

t — to — |x — xqI = —4tp + 2Mln(4rro) , (98) 



ai = Adi-ip , 

where ip is the gravitational lens potential having the form 

^/) = M + ejpqkPS‘>dj + ^ (t*) dpq In d . 



(99) 

(100) 



(Notice that in this gravitational lens approximation rd = fc* holds.) Remar- 
quably, the gravitational lens potential does depend on the gravitational source 
mass-quadrupole tensor only through its value at the time of closest approach. 
Furthermore, the gravitational lens potential decays like l/d^, i.e. it is not being 
influenced by the wave part of the gravitational field [40,37]. 

A direct consequence of the time delay formula is the frequency shift formula 
for a moving gravitational source 



6v 

V 





( 101 ) 
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where w* is the velocity of the observer. It is worthwhile noting that the ex- 
pression (101) holds for the source of electromagnetic waves being at past null 
infinity. An exhaustive treatment of the gravitational frequency shift for arbi- 
trary locations of observer, source of light, and the source of gravitational waves 
is rather complicated and has been done only recently in [38]. 



6 Detection of Gravitational Waves 



In the asymptotic regime of a gravitational wave field, the time delay reads 









where 0 is the angle between the receiver - (light) emitter and receiver - (grav- 
itational wave) source direction (cos0 = —N'^k'^) and where the assumption 
jx — xqI <C r has been made. Let us now apply the above formula to the time 
delay in a Michelson interferometer. Obviously, in this case, r = ro holds. For 
simplicity we assume that the interferometer device is oriented orthogonal to 
the propagation direction of the gravitational wave, i.e. N^k\ = N'^k\, where 
k\ and ^2 denote the directions of the two interferometer arms which are taken 
to be orthogonal {k\k,2 = 0) and of equal length L. We also assume that the 
light, emitted from the beam-splitter, is reflected once at the end mirrors. Fur- 
thermore, the interferometer arms are to be oriented such that they coincide 
with the main axes of the plus-polarization. Then the relative time delay of the 
reflected light beams reads 

= ^^[iy(t-ro)-/y(t-ro-2L)] . (103) 

The multiplication of the relative time delay by the angular frequency of the laser 
light, w, which is treated as constant in the approximation under consideration, 
results in the measurable phase shift at time t of A<P{t) = 2o;Z\Q(ki; t, t — 2L). 
If we assume for the plus-component of the gravitational wave the expression 
h+{t — ro) = A_|_ cos(wgf), where tOg is the constant frequency of the wave and 
A+ its constant amplitude, we obtain for the phase shift (cf. [61]) 

A<P{t) = 2A+— sin(ujgL) cos[ujg{t — L)] . (104) 

LOg 

The maximal amplitude is achieved if the condition cUgL = tt/ 2 holds. This yields 

L\max^(t) = 2A+— sin(wgt) . (105) 

LOg 

At the photo-diode the following photo-current results [62] 

Iph(t) = /min + ^ [1 - cos </>(t)] , (106) 
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where the phase is composed out of the signal A(p plus a modulation term 
from Pockels cells, <pm sin{uJmt) , i.e. 

4>{t) = A<P{t) + <j)m sin(wmt) . (107) 

The approximate decomposition of the expression (106) into a dc (“direct cur- 
rent” or non-alternate) part and a signal part reads Iph(t) = ^dc + where 

Lfr 

Idc = Imin + ^ [1 - COS A<P{t)] , (108) 

= lesJi i4>m) sin A${t) sin{uJmt) , (109) 

with Bessel functions Jq and Ji; les = /max — /min- Because of the smallness of 
the gravitational phase shift we get 

/dc = /min H ^ [1 ~ </o(<('m)] i (HO) 

/<n„ = /eff/i(«^'m)H</(t) sin(u;mi) • (HI) 

Using the equation (105), the latter equation can be written 

= /eff-ll(<()m)^+^|cOS [(Wm - Wg)t] - COS [(Wm -h Wg)t]| . (112) 

In this side-band form, the signal from the gravitational wave is being detected. 
For more details we refer to [61]. 
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Abstract. The range of expected amplitudes and spectral slopes of relic (squeezed) 
gravitational waves, predicted by theory and partially supported by observations, is 
within the reach of sensitive gravity-wave detectors. In the most favorable case, the 
detection of relic gravitational waves can be achieved by the cross-correlation of outputs 
of the initial laser interferometers in LIGO, VIRGO, GEO600. In the more realistic 
case, the sensitivity of advanced ground-based and space-based laser interferometers 
will be needed. The specific statistical signature of relic gravitational waves, associated 
with the phenomenon of squeezing, is a potential reserve for further improvement of 
the signal to noise ratio. 



1 Introduction 

It is appropriate and timely to discuss the detection of relic gravitatational 
waves at the experimental meeting like this one. We are in the situation when 
the advanced laser interferometers, currently under construction or in a design 
phase, can make the dream of detecting relic gravitons a reality. The detection 
of relic gravitational waves is the only way to learn about the evolution of the 
very early Universe, up to the limits of Planck era and Big Bang. 

The existence of relic gravitational waves is a consequence of quite general 
assumptions. Essentially, we rely only on the validity of general relativity and of 
basic principles of quantum field theory. The strong variable gravitational field of 
the early Universe amplifies the inevitable zero-point quantum oscillations of the 
gravitational waves and produces a stochastic background of relic gravitational 
waves measurable today [1]. It is important to appreciate the fundamental and 
unavoidable nature of this mechanism. Other physical processes can also generate 
stochastic backgrounds of gravitational waves. But those processes either involve 
many additional hypotheses, which may turn out to be not true, or produce a 
gravitational wave background (like the one from binary stars in the Galaxy) 
which should be treated as an unwanted noise rather than a useful and interesting 
signal. The scientific importance of detecting relic gravitational waves has been 
stressed on several occasions (see, for example, [2]- [4]). 

The central notion in the theory of relic gravitons is the phenomenon of 
superadiabatic (parametric) amplification. The roots of this phenomenon are 
known in classical physics, and we will remind ourselves of its basic features. 
As every wave-like process, gravitational waves use the concept of a harmonic 
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oscillator. The fundamental equation for a free harmonic oscillator is 

q + io\ = 0, ( 1 ) 

where q can be a displacement of a mechanical pendulum or a time-dependent 
amplitude of a mode of the physical field. The energy of the oscillator can be 
changed by an acting force or, alternatively, by a parametric influence, that is, 
when a parameter of the oscillator, for instance the length of a pendulum, is 
being changed. In the first case, the fundamental equation takes the form 

q + uj^q = f{t), (2) 

whereas in the second case we have 

q + Lo‘^{t)q = 0. (3) 

Eqs.(2) and (3) are profoundly different, both, mathematically and physically. 

Let us concentrate on the parametric influence. We consider a pendulum of 
length L oscillating in a constant gravitational field g. The unperturbed pen- 
dulum oscillates with the constant frequency lo = \fgjL. Fig. la illustrates the 
variation of the length of the pendulum L(t) by an external agent, shown by 
alternating arrows. Since L{t) varies, the frequency of the oscillator does also 
vary: uj{t) = ^yg/L{t). The variation L{t) does not need to be periodic, but can- 
not be too slow (adiabatic) if the result of the process is going to be significant. 
Otherwise, in the adiabatic regime of slow variations, the energy of the oscillator 
E and its frequency uj do change slowly, but E/uj remains constant, so one can 
say that the “number of quanta” E/hw in the oscillator remains fixed. In other 
words, for the creation of new “particles - excitations”, the characteristic time 
of the variation should be comparable with the period of the oscillator and the 
adiabatic behaviour should be violated. After some duration of the appropriate 
parametric influence, the pendulum will oscillate at the original frequency, but 
will have a significantly larger amplitude and energy than before. This is shown 
in Fig. lb. Obviously, the energy of the oscillator has been increased at the ex- 
pense of the external agent (pump field). For simplicity, we have considered a 
familiar case, when the length of the pendulum varies, while the gravitational 
acceleration g remains constant. Variation of g would represent a gravitational 
parametric influence and would even be in closer analogy to what we will study 
below. 

A classical oscillator must have a non-zero initial amplitude for the amplifi- 
cation mechanism to work. Otherwise, if the initial amplitude is zero, the final 
amplitude will also be zero. Indeed, imagine the pendulum strictly at rest, hang- 
ing stright down. Whatever the variation of its length is, it will not make the 
pendulum to oscillate and gain energy. In contrast, a quantum oscillator does 
not need to be excited from the very beginning. The oscillator can be initially 
in its quantum-mechanical vacuum state. The inevitable zero-point quantum 
oscillations are associated with the vacuum state energy One can imagine 
a pendulum hanging straight down, but fluctuating with a tiny amplitude deter- 
mined by the “half of the quantum in the mode” . In the classical picture, it is 
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Fig. 1. Parametric amplification, a) variation of the length of the pendulum, b) in- 
creased amplitude of oscillations. 



this tiny amplitude of quantum-mechanical origin that is being parametrically 
amplified. 

The Schrodinger evolution of a quantum oscillator depends crucially on whe- 
ther the oscillator is being excited parametrically or by a force. Consider the 
phase diagram {q,p), where q is the displacement and p is the conjugate mo- 
mentum. The vacuum state is described by the circle in the center (see Fig. 2). 
The mean values of q and p are zeros, but their variances (zero-point quantum 
fluctuations) are not zeros and are equal to each other. Their numerical values 
are represented by the circle in the center. Under the action of a force, the vac- 
uum state evolves into a coherent state. The mean values of p and q increased, 
but the variances are still equal and are described by the circle of the same 
size as for the vacuum state. On the other hand, under a parametric influence, 
the vacuum state evolves into a squeezed vacuum state. [For a recent review of 
squeezed states see [5], for example, and references therein.] Its variances for the 
conjugate variables q and p are significantly unequal and are described by an 
ellipse. As a function of time, the ellipse rotates with respect to the origin of 
the (PtP) diagram, and the numerical values of the variances oscillate, too. The 
mean numbers of quanta in the two states, one of which is coherent and another 
is squeezed vacuum, can be equal (similar to the coherent and squeezed states 
shown in Fig. 2) but the statistical properties of these states are significantly 
different. Among other things, the variance of the phase of the oscillator in a 
squeezed vacuum state is very small (squeezed). Graphically, this is reflected in 
the fact that the ellipse is very thin, so that that the uncertainty in the angle 
between the horizontal axis and the orientation of the ellipse is very small. This 
highly elongated ellipse can be regarded as a portrait of the gravitational wave 
quantum state that is being inevitably generated by parametric amplification, 
and which we will be dealing with below. 

A wave-field is not a single oscillator, it depends on spatial coordinates and 
on time and may have several independent components (polarization states). 
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Fig. 2. Some quantum states of a harmonic oscillator. 



However, the field can be decomposed into a set of spatial Fourier harmonics. In 
this way we represent the gravitational wave field as a collection of many modes, 
many oscillators. Because of the nonlinear character of the Einstein equations, 
each of these oscillators is coupled to the variable gravitational field of the sur- 
rounding Universe. For sufficiently short gravitational waves of experimental 
interest, this coupling was especially effective in the early Universe, when the 
condition of the adiabatic behaviour of the oscillator was violated. It is this 
homogeneous and isotropic gravitational field of all the matter in the early Uni- 
verse that played the role of the external agent - pump field. The variable pump 
field acts parametrically on the gravity-wave oscillators and drives them into 
multiparticle states. Concretely, the initial vacuum state of each pair of waves 
with oppositely directed momenta evolves into a highly correlated state known 
as the two-mode squeezed vacuum state [6,7]. The strength and duration of the 
effective coupling depends on the oscillator’s frequency. They all start in the 
vacuum state but get excited to various amounts. As a result, a broad spectrum 
of relic gravitational waves is being formed. This spectrum is accessible to our 
observations today. 

Let us formulate the problem in more detail. 



2 Cosmological Gravitational Waves 

In the framework of general relativity, a homogeneous isotropic gravitational 
field is decribed by the line element 

ds^ = c^dt^ — a^{t)5ijdx^dx^ = a^{r])[dr]^ — Sijdx^dx^]. (4) 

In cosmology, the function a{t) (or a(ry)) is called scale factor. In our discussion, 
it will represent the gravitational pump field. 
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Cosmological gravitational waves are small corrections hij to the metric ten- 
sor. They are defined by the expression 

ds^ = o^(r?)[d?7^ — {Sij + hij)dx^dx^]. (5) 

The functions hij{r], x) can be expanded over spatial Fourier harmonics and 
g-jnx^ where n is a constant wave vector. In this way, we reduce the dynam- 
ical problem to the evolution of time-dependent amplitudes for each mode n. 
Among six functions hij there are only two independent (polarization) compo- 
nents. This decomposition can be made, both, for real and for quantized field hij. 
In the quantum version, the functions hij are treated as quantum-mechanical 
operators. We will use the Heisenberg picture in which the time evolution is car- 
ried out by the operators while the quantum state is fixed. This picture is fully 
equivalent to the Schrodinger picture, discussed in the Introduction, in which 
the vacuum state evolves into a squeezed vacuum state while the operators are 
time independent. 

The Heisenberg operator for the quantized real field hij can be written as 

pOO 2 if ^ i * 

%( t 7 , x ) = , / Pij{n)—= L + hniv)e~"'^'" L ( 6 ) 

J_oo ^ v2n L 

where (7 is a constant which will be discussed later. The creation and annihilation 
operators satisfy the conditions [c„, c^] = (5s/sd^(n — m), Cn|0) = 0, where 
|0) (for each n and s) is the fixed initial vacuum state discussed below. The 
wave number n is related with the wave vector n by n = (Sijn^n-^)^'^^. The two 

S 

polarization tensors Pij{n) (s = 1,2) obey the conditions 

s s s s s s 

P^jji^ = 0, PijS^^ = 0, P^jP Pij{-n) = Ty(n). 

The time evolution, one and the same for all n belonging to a given n, is repre- 

S 

sented by the complex time-dependent function hn{v)- This evolution is dictated 
by the Einstein equations. The nonlinear nature of the Einstein equations leads 

S 

to the coupling of hniv) with the pump field a{rj). For every wave number n and 

S 

each polarization component s, the functions hn{v) have the form 

hniv) = (7) 

where Un{ri) and Vn{p) can be expressed in terms of the three real functions (the 
polarization index s is omitted): r„ - squeeze parameter, (/>„ - squeeze angle, 

- rotation angle, 

Un = e*®” cosh r„, 

The dynamical equations for Un{ri) and Vn{p) 

. dUn ■ o' » . dVn . a' » 

I—. =nUn + l — V„, I — — =nVn+l—Uj, 

drj a drj a 



(9) 
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lead to the dynamical equations governing the functions r„(? 7 ), (j)n{v)j ^n{v) [^]: 

r^= — cos2(/)„, (j)'^ = —n sin2^„coth 2r„, 9'^ = —n sin2(/>„tanh r„, 

( 10 ) 

where ' = d/dr;, and the evolution begins from r„ = 0. This value of r„ charac- 
terizes the initial vacuum state |0) which is defined long before the interaction 
with the pump field became effective, that is, long before the coupling term a' /a 
became comparable with n. The constant C should be taken as C = vTOtt ^pi 
where Ipi = is the Planck length. This particular value of the con- 

stant C guarantees the correct quantum normalization of the field: energy 
per each mode in the initial vacuum state. The dynamical equations and their 
solutions are identical for both polarization components s. 

Equations (9) can be translated into the more familiar form of the second- 

S s s 

order differential equation for the function M„(? 7 ) = Un{rf) + v^{rf) = a{rj)hn{il) 

m-- 

fj," + Hn ~ = 0 - ( 11 ) 

Clearly, this is the equation for a parametrically disturbed oscillator (compare 
with Eq. (3)). In absence of the gravitational parametric influence represented 
by the term a" / a, the frequency of the oscillator defined in terms of rj-time would 
be a constant: n. Whenever the term a" /a can be neglected, the general solution 
to Eq. (11) has the usual oscillatory form 

M„(r?) = (12) 

where the constants An, Bn are determined by the initial conditions. On the 
other hand, whenever the term a" /a is dominant, the general solution to Eq. 
(11) has the form 

d77 

finiv) = Cna + Dna (13) 

In fact, this approximate solution is valid as long as n is small in comparison 
with |a'/a|. This is more clearly seen from the equivalent form of Eq. (11) written 
in terms of the function hn{r]) [8]: 

h'n 2 — h'n hn = 0. (14) 

a 

For growing functions a(??), that is, in expanding universes, the second term 
in Eq.(13) is usually smaller than the first one (see below), so that, as long as 
n <C a' /a, the dominant solution is the growing function ^n{v) = Cna{r]), and 



hn = const. 



(15) 
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Equation (11) can be also looked at 
as a kind of the Schrodinger equation 
for a particle moving in presence of ^ 
the effective potential U{t]) = a" /a. In 
the situations that are normally con- 
sidered, the potential U{i]) has a bell- 
like shape and forms a barrier (see Fig. 

3). When a given mode n is outside 
the barrier, its amplitude is adia- 
batically decreasing with time: hn oc 
This is shown in Fig. 3 by oscil- 
lating lines with decreasing amplitudes 
of oscillations. The modes with suffi- 
ciently high frequencies do not interact 
with the barrier, they stay above the 
barrier. Their amplitudes hn behave 
adiabatically all the time. For these 
high-frequency modes, the initial vac- 
uum state (in the Schrodinger picture) Fig. 3. Effective potential U(rf}. 

remains the vacuum forever. On the 

other hand, the modes that interact with the barrier are subject to the supera- 
diabatic amplification. Under the barrier and as long as n < a' /a, the function 
hn stays constant instead of the adiabatic decrease. For these modes, the initial 
vacuum state evolves into a squeezed vacuum state. 

After having formulated the initial conditions, the present day behaviour of 
r„, (f>n, On (or, equivalently, the present day behaviour of hn) is essentially all 
we need to find. The mean number of particles in a two-mode squeezed state 
is 2 sinh^ r„ for each s. This number determines the mean square amplitude 
of the gravitational wave field. The time behaviour of the squeeze angle 4>n 
determines the time dependence of the correlation functions of the field. The 
amplification (that is, the growth of r„) governed by Eq. (10) is different for 
different wave numbers n. Therefore, the present day results depend on the 
present day frequency v (y = cn/27ra) measured in Hz. 

In cosmology, the function H = a/a = ca' /a? is the time-dependent Hubble 
parameter. The function I = c/H is the time-dependent Hubble radius. The 
time-dependent wavelength of the mode n is A = 2TTa/n. The wavelength A has 
this universal definition in all regimes. In contrast, the v defined as i/ = cn/2Tra 
has the usual meaning of a frequency of an oscillating process only in the short- 
wavelength (high-frequency) regime of the mode n, that is, in the regime where 
A <C Z. As we have seen above, the qualitative behaviour of solutions to Eqs. 
(11), (14) depends crucially on the comparative values of n and a' /a, or, in 
other words, on the comparative values of X{rj) and l{rj). This relationship is 
also crucial for solutions to Eq. (10) as we shall see now. 

In the short-wavelength regime, that is, during intervals of time when the 
wavelength X{ri) is shorter than the Hubble radius 1{t]) = a^ /a' , the term n in 
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(10) is dominant. The functions 0 n{rf) are </>„ = —n{r] + rjn), = (pn 

where ?7„ is a constant. The factor cos 2(/>„ is a quickly oscillating function of time, 
so the squeeze parameter r„ stays practically constant. This is the adiabatic 
regime for a given mode. 

In the opposite, long-wavelength regime, the term n can be neglected. The 
function is tan (pniv) ~ const / a? {rf) , and the squeeze angle quickly approaches 
one of the two values: (/)„ = 0 or = tt (analog of “phase bifurcation” [9]). The 
squeeze parameter rn{fj) grows with time according to 

TniVi) « , (16) 



where a* is the value of 0(77) at 77*, when the long- wavelength regime, for a given 
71, begins. The final amount of r„ is 



in 

a* 



(17) 



where a** is the value of 0(77) at 77**, when the long-wavelength regime and 
amplification come to the end. It is important to emphasize that it is not a 
“sudden transition” from one cosmological era to another that is responsible 
for amplification, but the entire interval of the long-wavelength (non-adiabatic) 
regime. 

After the end of amplification, the accumulated (and typically large) squeeze 
parameter r„ stays approximately constant. The mode is again in the adiabatic 

S S ^ 

regime. In course of the evolution, the complex functions u„{rj) + become 

s s* 

practically real, and one has hn{v) ~ hniv) ~ cos (pniji) ■ Every amplified 
mode n of the field (6) takes the form of a product of a function of time and a 
(random, operator-valued) function of spatial coordinates; the mode acquires a 
standing-wave pattern. The periodic dependence cos</>„(?7) will be further dis- 
cussed below. 

It is clearly seen from the fundamental equations (10), (11), (14) that the final 
results depend only on a(v/). Equations do not ask us the names of our favorite 
cosmological prejudices, they ask us about the pump field 0(77). Conversely, from 
the measured relic gravitational waves, we can deduce the behaviour of 0(77), 
which is essentially the purpose of detecting the relic gravitons. 



3 Cosmological Pump Field 

With the chosen initial conditions, the final numerical results for relic gravita- 
tional waves depend on the concrete behaviour of the pump field represented by 
the cosmological scale factor 0(77). We know a great deal about 0(77). We know 
that 0(77) behaves as 0(77) oc at the present matter-dominated stage. We know 
that this stage was preceeded by the radiation-dominated stage 0(77) oc 77. At 
these two stages of evolution the functions 0(77) are simple power-law functions 
of 77. What we do not know is the function a(rj) describing the initial stage of 
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expansion of the very early Universe, that is, before the era of primordial nu- 
cleosynthesis. It is convenient to parameterize 0(77) at this initial stage also by 
power-law functions of rj. First, this is a sufficiently broad class of functions, 
which, in addition, allows us to find exact solutions to our fundamental equa- 
tions. Second, it is known [ 1 ] that the pump fields 0(77) which have power-law 
dependence in terms of 77, produce gravitational waves with simple power-law 
spectra in terms of ly. These spectra are easy to analyze and discuss in the 
context of detection. 

We model cosmological expansion by several successive eras. Concretely, we 
take 0(77) at the initial stage of expansion (z-stage) as 

a{v) = lo\v\^^^, ( 18 ) 

where 77 grows from —00, and 1 -|- /3 < 0. We will show later how the available 
observational data constrain the parameters lo and j 3 . The z-stage lasts up to 
a certain 77 = 771, 771 < 0. To make our analysis more general, we assume that 
the z-stage was followed by some interval of the 0-stage (0 from Zeldovich). It is 
known that an interval of evolution governed by the most “stiff’ matter (effective 
equation of state p = e) advocated by Zeldovich, leads to a relative increase 
of gravitational wave amplitudes [1]. It is also known that the requirement of 
conistency of the graviton production with the observational restrictions does 
not allow the “stiff’ matter interval to be too much long [1], [10]. However, we 
want to investigate any interval of cosmological evolution that can be consistently 
included. In fact, the 0-stage of expansion that we include is quite general. It 
can be governed by a “stiffer than radiation” [11] matter, as well as by a “softer 
than radiation” matter. It can also be simply a part of the radiation-dominated 
era. Concretely, we take 0(77) at the interval of time from 771 to some rjg (0-stage) 
in the form 

0(77) = Zoa^(77 - 77p)^+^% (19) 

where I + (is >0. For the particular choice (is = 0, the 0-stage reduces to 
an interval of expansion governed by the radiation-dominated matter. Starting 
from rjs and up to 772 the Universe was governed by the radiation-dominated 
matter (e-stage). So, at this interval of evolution, we take the scale factor in the 
form 

a{T]) = lotteir] - Pe)- ( 20 ) 

And, finally, from 77 = 772 the expansion went over into the matter-dominated 
era (77z-stage): 

~ Vm) • (21) 

A link between the arbitrary constants participating in Eqs. (18) - (21) is pro- 
vided by the conditions of continuous joining of the functions 0(77) and a'{rj) at 
points of transitions 771, rjs, 772. 

We denote the present time by rjR (R from reception). This time is defined 
by the observationally known value of the present-day Hubble parameter H^rjn) 
and Hubble radius Ih = c/H{rjpj). For numerical estimates we will be using 
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Ih ~ 2 X 10^® cm. It is convenient to choose = 1> so that 0 ( 77 /}) = 21 h- 

The ratio 

a{riR)/a{m) = C2 

is believed to be around C 2 = 10"^ . We also denote 



aim) / aiVs) = Cs , a(77s)/a(77i) = Ci • 



With these definitions, all the constants participating in Eqs. (18) - (21) (except 
parameters [3 and (3s which should be chosen from other considerations) are 
being expressed in terms of Ih, C 2 > Cs> and Ci- For example, 



\Vi\ 



+ /^l 

2ClC.Ci^ 



The important constant lo is expressed as 



lo = hlH((^ CfCl 






( 22 ) 



where b = 2^+^/|l + Note that 6=1 for (3 = —2. [This expression for 

lo may help to relate formulas written here with the equivalent treatment [ 12 ] 
which was given in slightly different notations.] The sketch of the entire evolution 
0 ( 77 ) is given in Fig. 4. 

We work with the spatially-fiat models (4). At every instant of time, the 
energy density e(j]) of matter driving the evolution is related with the Hubble 
radius l{r]) by 



Ke{r]) = 



P{T,y 



(23) 



where k = SttG/c^. For the case of power-law scale factors 0(77) oc 77^+^, the 
effective pressure 77 ( 77 ) of the matter is related with the e{rj) by the effective 
equation of state 



1-/3 

^ 3(1 + /3)"- 



(24) 



a(ri) 




Fig. 4. Scale factor 0 ( 77 ). 
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For instance, p = 0 for /3 = 1, p = |e for /3 = 0, p = — e for /3 = —2, and so on. 
Each interval of the evolution (18)-(21) is governed by one of these equations of 
state. 

In principle, the function a{rj) could be even more complicated than the one 
that we consider. It could even include an interval of the early contraction, in- 
stead of expansion, leading to the “bounce” of the scale factor. In case of a 
decreasing a{r]) the gravitational-wave equation can still be analyzed and the 
amplification is still effective [1]. However, the Einstein equations for spatially- 
flat models do not permit a regular “bounce” of a{rj) (unless e vanishes at the 
moment of “bounce”). Possibly, a “bounce” solution can be realized in alter- 
native theories, such, for example, as string-motivated cosmologies [13]. For a 
recent discussion of spectral slopes of gravitational waves produced in “bounce” 
cosmologies, see [14]. 



4 Solving Gravitational Wave Equations 



The evolution of the scale factor a{ri) given by Eqs. (18) - (21) and sketched in 
Fig. 4 allows us to calculate the function a' /a. This function is sketched in Fig. 5. 
In all the theoretical generality, the left-hand-side of the barrier in Fig.5 could 
also consist of several pieces, but we do not consider this possibility here. The 
graph also shows the important wave numbers uh, u- 2 , Us, n\. The uh marks 
the wave whose today’s wavelength \{riR) = 27ra(?7_R ) /uh is equal to the today’s 
Hubble radius Ih- With our parametrization a^qn) = 21^, this wavenumber is 
Uh = 47 t. The ri 2 marks the wave whose wavelength X{rj 2 ) = 2Tra{r]2)/n2 at 
p = 772 is equal to the Hubble radius ^( 772 ) at 77 = 772 . Since X{r]R) / X{r] 2 ) = 
{n 2 /nH)[a{r]R)/a{r] 2 )] and l{r]ii)/l{r] 2 ) = [a{r]R)/a{r] 2 )][a{T]R)/a{T] 2 )]^^'^, this 
gives us n 2 /riH = [a( 7 ?fi)/a( 7 ? 2 )]^^^ = ■ Working out in a similar fashion 

other ratios, we find 



~ S2 > 

UH ri 2 



77i ^ 1 

^ . 

Us 



(25) 



Solutions to the gravitational wave equations exist for any a{rj). At intervals 
of power-law dependence 0 ( 77 ), solutions to Eq. (11) have simple form of the 
Bessel functions. We could have found piece-wise exact solutions to Eq. (11) and 
join them in the transition points. However, we will use a much simpler treatment 
which is sufficient for our purposes. We know that the squeeze parameter r„ 
stays constant in the short-wavelength regimes and grows according to Eq. (16) 
in the long-wavelength regime. All modes start in the vacuum state, that is, 
r„ = 0 initially. After the end of amplification, the accumulated value (17) stays 
constant up to today. To find today’s value of e”” we need to calculate the ratio 
a**( 7 r)/a*( 7 r). For every given n, the quantity a* is determined by the condition 
A( 77 *) = /(t 7 *), wheras a** is determined by the condition A(t 7 **) = ?(? 7 *,). 

Let us start from the mode n = ni. For this wave number we have a* = 
a** = 0 ( 771 ), and therefore = 0. The higher frequency modes n > rii (above 
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Fig. 5. The function (x j a for the scale factor from Fig. 4. 



the barrier in Fig. 5) have never been in the amplifying regime, so we can write 

e’"" = 1, n > ni. (26) 



Let us now consider the modes n in the interval rii > n > Ug- For a given 
n we need to know a*(n) and a^i^{n). Using Eq. (18) one finds a*(n)/o*(ni) = 
(ni/n)^+^, and using Eq. (19) one finds a«*(n) /a** (ris) = (ns/n)^+^T Therefore, 
one finds 

a**{n) _ g^,«(ns) \ 1+^-= f ri/\ 

at{n) a*(ni) \ nJ \ni J 

Since a»*(ris) = a(?7s), a*(ni) = a(? 7 i), and a{rjs) / a{r]i) = Ci = {ni/nsY^^\ we 
arrive at 

a^^{n) _ ( \ /5-/3« 

a*(n) \ni) 



Repeating this analysis for other intervals of the decreasing n, we come to the 
conclusion that 



e 



Tn 



e 



Tn 



e 



Tn 




ni > n> Ug, 




Ug > n> U 2 , 




U 2 > n > riH- 



(27) 



The mnemonic rule of constructing e’'" at successive intervals of decreasing n is 
simple. If the interval begins at Ux, one takes (n/ua;)^*”'^** and multiples with 
e’’"* , that is, with the previous interval’s value e’’" calculated at the end of that 
interval Ux- For the function a' /a that we are working with, the /?* is always (3, 
whereas the /?** takes the values (3g, 0, 1 at the successive intervals. 
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The modes with n < uh are still in the long-wavelength regime. For these 
modes, we should take a{rifi) instead of atn,{n). Combining with we find 



e 



Vn 




n < uh- 



(28) 



Formulas (26) - (28) give approximate values of r„ for all n. The factor e’’” is 
gi-n > 2 ^ £qj- ri < m, and e’’" 1 for n <C ui. This factor determines the mean 

square amplitude of the gravitational waves. 

The mean value of the field hij is zero at every moment of time rj and in 
every spatial point x: (0|/iij(r7,x)|0) = 0. The variance 

(0|%(?7,x)/i*^(?7,x)|0) = {h^) 



is not zero, and it determines the mean square amplitude of the generated field - 
the quantity of interest for the experiment. Taking the product of two expressions 
(6) one can show that 



{h^)=— nVk„(r? dn= h^{n,r])— . 

Jo 7^1 Jo n 

Using the representation (7), (8) in Eq. (29) one can also write 

fj2 



{h^) = 



p 2/^2 



ndn(cosh 2r„ + cos 2(j)„ sinh 2r„) . 



(29) 



(30) 



We can now consider the present era and use the fact that e’’" are large numbers 
for all n in the interval of our interest rii > n > uh- Then, we can derive 



h{n,T]) 



C 1 

TT a{f]R) 



ne ^ 



cos (j>n{ri) = 8v^ 





cos 4>n{'n) ■ 



(31) 



The quantity h(n, t]) is the dimensionless spectral amplitude of the field whose 
numerical value is determined by the calculated squeeze parameter r„. The os- 
cillatory factor cos (j)n{v) reflects the squeezing (standing wave pattern) acquired 
by modes with rii > n > uh- For modes with n < hr this factor is approxi- 
mately 1. For high-frequency modes n uh one has 4>n{v) ~ ri (?7 — rjn) ^ 1, 
so that h{n, rj) makes many oscillations while the scale factor a{rj) is practically 
fixed at a{r]R). 

The integral (30) extends formally from 0 to oo. Since r„ « 0 for n > ni, 
the integral diverges at the upper limit. This is a typical ultra-violet divergence. 
It should be discarded (renormalized to zero) because it comes from the modes 
which have always been in their vacuum state. At the lower limit, the integral 
diverges, if /3 < —2. This is an infra-red divergence which comes from the as- 
sumption that the amplification process has started from infinitely remote time 
in the past. One can deal with this divergence either by introducing a lower 
frequency cut-off (equivalent to the finite duration of the amplification) or by 
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considering only the parameters /3 > — 2 , in which case the integral is convergent 
at the lower limit. It appears that the available observational data (see below) 
favour this second option. The particular case f3 = —2 corresponds to the de 
Sitter evolution a{rj) oc \r]\~^. In this case, the h{n) found in Eqs. (31), (28) does 
not depend on n. This is known as the Harrison-Zeldvich, or scale-invariant, 
spectrum. 

An alternative derivation of the spectral amplitude h{n) uses the approximate 
solutions (12), (13) to the wave equation (11). This method gives exactly the 
same, as in Eqs. (31), (26) - (28) numerical values of h{n), but does not reproduce 
the oscillatory factor coscpniv)- 

One begins with the initial spectral amplitude hi{n) defined by quantum 
normalization: hi{n) = 8 y^(^Pi/Ai). This is the amplitude of the mode n at 
the moment 77 , of entering the long wavelength regime, i.e. when the mode’s 
wavelength Ai is equal to the Hubble radius ^( 77 *). For Ai one derives 



Thus, we have 



Aj — jlo 
0 




hi{n) = A 




where A denotes the constant 



A = 68 v^^ . 



(32) 



(33) 



(34) 



The numbers hi{n) are defined at the beginning of the long- wavelength regime. 
In other words, they are given along the left-hand-side slope of the barrier in 
Fig. 5. We want to know the final numbers (spectral amplitudes) h{n) which 
describe the field today, at t]r. 

According to the dominant solution hn(jj) = const of the long- wavelength 
regime (see Eq. (15)), the initial amplitude hi{n) stays practically constant up 
to the end of the long-wavelength regime at 77 **, that is, up to the right-hand- 
side slope of the barrier. [The second term in Eq. (13) could be important only 
at the z-stage and only for parameters (3g < —( 1 / 2 ), which correspond to the 
effective equations of state p > e. In order to keep the analysis simple, we do 
not consider those cases.] After the completion of the long-wavelength regime, 
the amplitudes decrease adiabatically in proportion to 1 / 0 ( 77 ), up to the present 
time. Thus, we have 



h{n) = A 



n \ a^^(nj 

nn J a{riR) ' 



(35) 



Let us start from the lower end of the spectrum, n < nn, and go upward 
in n. The modes n < uh have not started yet the adiabatic decrease of the 
amplitude, so we have 



h{n) = A 



n 

Uh 



2+/3 



n < Uh- 



(36) 
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Now consider the interval U 2 > n> uh- At this interval, the / a{rjpi) scales 

as inH/nY, so we have 



h{n) = A 



n 2 > n> Uh- 



\nHj 

At the interval rig > n > ri2 the ratio 

_ a^i,{n) 0(772) 
a{VR) a{r]2) a{-qn) 



scales as (712/77) (n///n2)^, so we have 



/ \ 1+/5 

= A M 



h{n) = A 



, Us > n > U 2 - 



\nH J 772 

Repeating the same analysis for the interval ni > n > Us we find 

\ 1+/3— /3s / ^ \ /3s 



h{n)=A(—\ 

\nH/ 



Ha.] He 

Uh J 772 



, 77i > 77 > 77s 



(37) 



(38) 



(39) 



It is seen from Eq. (39) that an interval of the 2-stage with /3s < 0 (the al- 
ready imposed restrictions require also (—1/2) < /3s) bends the spectrum h{n) 
upwards, as compared with Eq. (38), for larger n. If one recalls the relationship 
(22) between lo and Ih and uses (27), (28) in Eq. (31) one arrives exactly at 
Eqs.(36)-(39) up to the oscillating factor cos^„(?7). 

Different parts of the barrier in Fig. 5 are responsible for amplitudes and 
spectral slopes at different intervals of n. The sketch of the generated spectrum 
h{n) in conjunction with the form of the barrier is shown in Fig. 6. 

The present day frequency of the oscillating modes, measured in Hz, is defined 
as 77 = cn/2Tra{r]ji). The lowest frequency (Hubble frequency) is vh = c/Ih- For 
numerical estimates we will be using vh ~ 10“^® Hz. The ratios of n are equal 
to the ratios of v, so that 77/77/7 = v jvn, for example. For high-frequency modes 
we will now often use the ratios of v instead of ratios of n. 

In addition to the spectral amplitudes h{n) the generated field can be also 
characterized by the spectral energy density parameter fig(n). The energy den- 
sity tg of the gravitational wave field is 

The mean value (0|eg(?7, x)|0) is given by 



K{eg) = 



1 

4a^ 2tt‘^ 






d77. 



(40) 



S = 1 



For high-frequency modes, it is only the factor that needs to be different!- 

I 1 2 9 I 1 2 

ated by 77 . After avaraging out the oscillating factors, one gets \hn\ = n \hn\ , 
so that 






1 

4a^ 



^ 2,2. 
n h in) — . 
n 



( 41 ) 
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In fact, the high-frequency approximation, that has been used, permits integra- 
tion over lower n only up to nn- And the upper limit, as was discussed above, is 
in practice ni, not infinity. The parameter f2g is defined as fig = {eg)/e, where 
e is given by Eq. (23) (critical density). So, we derive 



and 




The dimensionless quantity 
I7g(i/) is useful because it allows 
us to quickly evaluate the cosmo- ^ 
logical importance of the gener- A -- 
ated field in a given frequency in- 
terval. However, the primary and 
more universal concept is h{v), ~~ 

not fig{v). It is the field, not j -- 
its energy density, that is di- 
rectly measured by the gravity- f) 
wave detector. One should also 
note that some authors use 
quite a misleading definition 
^gif) = {f/Pc)(dpg^/dlnf) 
which suggests differentiation of 

the gravity-wave energy density g_ Amplitudes and spectral slopes of h{n) 

by frequency. This would be in- g^j-g determined by different parts of the barrier 
correct and could cause disagree- a' /a. 
ments in numerical values of fig. 

Whenever we use I7g(i/), we mean relationship (42); and for order of magnitude 
estimates one can use [1]: 





fig{iy) 




(43) 



5 Theoretical and Observational Constraints 

The entire theoretical approach is based on the assumption that a weak quantized 
gravity-wave field interacts with a classical pump field. We should follow the 
validity of this approximation throughout the analysis. The pump field can be 
treated as a classical gravitational field as long as the driving energy density 
e is smaller than the Planck energy density, or, in other words, as long as the 
Hubble radius l{r]) is greater than the Planck length Ipi. This is a restriction 
on the pump field, but it can be used as a restriction on the wavelength of 
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the gravity-wave mode n at the time of entry the long-wavelength regime. If 
l^n*) > ^Pi; then \i > /pi. The Ai is given by Eq. (32). So, we need to ensure 
that 



b 



Ipi 




< 1 . 



At the lowest-frequency end v = vh this inequality gives b{lp\/lo) < 1. In fact, 
the observational constraints (see below) give a stronger restiction: 

b ^ « 10-6, (44) 

which we accept. Then, at the highest-frequency end p = we need to satisfy 

(t) ^ 



Let us now turn to the generated spectral amplitudes h{iy). According to 
Eq. (36) we have h{i'H) ~ ^8i/7t(?pi/?o)- The measured microwave beckgound 
anisotropies, which we discuss below, require this number to be at the level of 
IQ- 6 , which gives the already mentioned Eq. (44). The quantity h{vi) at the 
highest frequency v\ is given by Eq. (39): 



7 / \ 

h{vi) = b8^/n -p ( — ) 
lo J 




vh 

V2 ' 



Using Eq. (22) this expression for h{vi) can be rewritten as 

h(pi) = 8 v^ = 8 v^^, (46) 

IH Vh Ai 

where Ai = c/pi. This last expression for h(i>i) is not surprising: the modes 
with V > v\ are still in the vacuum state, so the numerical value of h{u\) is 
determined by quantum normalization. 

All the amplified modes have started with small initial amplitudes hi, at 
the level of zero-point quantum fluctuations. These amplitudes are also small 
today, since the hi could only stay constant or decrease. However, even these 
relatively small amplitudes should obey observational constraints. We do not 
want the fig in the high-frequency modes, which might affect the rate of the 
primordial nucleosynthesis, to exceed the level of lO”®. This means that flg{vi) 
cannot exceed the level of lO”® or so. The use of Eq. (42) in combination with 
I7g(pi) « IQ -6 and h{vi) from Eq. (46), gives us the highest allowed frequency 
Pi « 3 X 10^6 JJ 2 . We will use this value of pi in our numerical estimates. 
Returning with this value of to Eq. (45) we find that parameter [3 can only 
be (3 < —1.8. We will be treating /3 = — 1.8 as the upper limit for the allowed 
values of (3. 
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We can now check whether the accepted parameters leave room for the pos- 
tulated z-stage with fig < 0. Using Eq. (22) we can rewrite Eq. (44) in the 
form 



10 



-ehi_ 

lp\ 




(47) 



We know that v^jvH = 10^ and vijvg is not smaller than 1. Substituting all 
the numbers in Eq. (47) one can find that this equation cannot be satisfied for 
the largest possible j3 = —1.8. In the case [3 = —1.9, Eq. (47) is only marginally 
satisfied, in the sense that a significant deviation from fig = Q toward negative 
(3g can only last for a relatively short time. For instance, one can accomodate 
fig = —0.4 and Ug = 10® Hz. On the other hand, if one takes f) = —2, a somewhat 
longer interval of the 2 ;-stage with fig < 0 can be included. For instance, Eq. 
(47) is satisfied if one accepts Ug = lO”"* Hz and fig = —0.3. This allows us 
to slightly increase h(i') in the interval i^g < v < vi, as compared with the 
values of h{i^) reached in the more traditional case fi = —2, fig = 0. In what 
follows, we will consider consequences of this assumption for the prospects of 
detection of the produced gravitational wave signal. Finally, let us see what the 
available information on the microwave background anisotropies [15,16] allows 
us to conclude about the parameters fi and lo- 

Usually, cosmologists operate with the spectral index n (not to be confused 
with the wave number n) of primordial cosmological perturbations. Taking into 
account the way in which the spectral index n is defined, one can relate n with 
the spectral index fi + 2 that shows up in Eq.(36). The relationship between 
them is n = 2/3 -|- 5. This relationship is valid independently of the nature of 
cosmological perturbations. In particular, it is valid for density perturbations, 
in which case the h{n) of Eq.(36) is the dimensionless spectral amplitude of 
metric perturbations associated with density perturbations. If primordial gravi- 
tational waves and density perturbations were generated by the mechanism that 
we discuss here (the assumption that is likely to be true) than the parameter fi 
that participates in the spectral index is the same one that participates in the 
scale factor of Eq. (18). Primordial gravitational waves and primordial density 
perturbations with the same spectral index produce approximately the same 
lower-order multipole distributions of large-scale anisotropies. 

The evaluation of the spectral index n of primordial perturbations have re- 
sulted in n = 1.2 ±0.3 [16] or even in a somewhat higher value. A recent analysis 
[17] of all available data favors n = 1.2 and the quadrupole contribution of grav- 
itational waves twice as large as that of density perturbations. One can interpret 
these evaluations as indication that the true value of n lies somewhere near 
n = 1.2 (hopefully, the planned new observational missions will determine this 
index more accurately). This gives us the parameter fi somewhere near fi = —1.9. 
We will be using /3 = — 1.9 in our estimates below, as the observationally pre- 
ferred value. The parameter fi can be somewhat larger than fi = —1.9. However, 
as we already discussed, the value fi = —1.8 (n = 1.4) is the largest one for 
which the entire approch is well posed. The Harrison-Zeldovich spectral index 
n = 1 corresponds to /3 = — 2. 
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log h(v) 




logv, Hz 



Fig. 7. Expected spectrum h(v) for the case /3 = —1.9. 



The observed quadrupole anisotropy of the microwave background radia- 
tion is at the level 5T/T k, 10“®. The quadrupole anisotropy that would be 
produced by the spectrum (36) - (39) is mainly accounted for by the wave num- 
bers near nn- Thus, the numerical value of the quadrupole anisotropy produced 
by relic gravitational waves is approximately equal to A. According to general 
physical considerations and detailed calculations [18], the metric amplitudes of 
long-wavelength gravitational waves and density perturbations generated by the 
discussed amplification mechanism are of the same order of magnitude. There- 
fore, they contribute roughly equally to the anisotropy at lower multipoles. This 
gives us the estimate A k. 10“®, that we have already used in Eq. (44). It is not 
yet proven observationally that a significant part of the observed anosotropies 
at lower multipoles is indeed provided by relic gravitational waves, but we can 
at least assume this with some degree of confidence. It is likely that the future 
measurements of the microwave background radiation will help us to verify this 
theoretical conclusion. 

Combining all the evaluated parameters together, we show in Fig. 7 the 
expected spectrum of h{v) for the case [3 = —1.9. A small allowed interval of the 
z-stage is also included. The intervals of the spectrum accessible to space-based 
and ground-based interferometers are indicated by vertical lines. 

It is necessary to note [18,19] that the confirmation of any n > 1 (/3 > —2) 
would mean that the very early Universe was not driven by a scalar field - the 
cornerstone of inflationary considerations. This is because the n > 1 (/3 > —2) 
requires the effective equation of state at the initial stage of expansion to be 
e+p < 0 (see Eq. (24)), but this cannot be accomodated by any scalar field with 
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whichever scalar field potential. The available data do not prove yet that n > 1, 
but this possibility seems likely. 

It is also necessary to say that a certain damage to gravitational wave research 
was inflicted by the so called “standard inflationary result” . The “standard in- 
flationary result” predicts infinitely large amplitudes of density perturbations 
in the interval of spectrum with the Harrison-Zeldovich slope n = 1 (/3 = —2): 
Spj p oc \ — n. The metric (gravitational field) amplitudes of density pertur- 

bations are also predicted to be infinitely large, in the same proportion. Through 
the so-called “consistency relation” this divergence leads to the vanishingly small 
amplitudes of relic gravitational waves. Thus, the “standard” inflationary theory 
predicts zero for relic gravitational waves; the spectrum similar in shape to the 
one shown in Fig. 7 would have been shifted down by many orders of magnitude. 
This prediction is hanging on the “standard inflationary result” , but the “result” 
itself is in a severe conflict not only with theory but with observations too: when 
the observers marginalize their data to n = 1 (enforce this value of n in data anal- 
ysis) they find finite and small density perturbations instead of infinitely large 
perturbations predicted by inflationary theorists. (For analytical expressions of 
the “standard inflationary result” see any inflationary article, including recent 
reviews. For graphical illustration of the predicted divergent density perturba- 
tions and quadrupole anisotropies see [20], for example. For critical analysis and 
disagreement with the “standard inflationary result” see [18].) General relativity 
and quantum field theory do not produce the “standard inflationary result” , so 
we shall better return to what they say. 



6 Detectability of Relic Gravitational Waves 



We switch now from cosmology to prospects of detecting the predicted relic 
gravitational waves. The ground-based [21]-[23] and space-based [24], [25] laser 
interferometers (see also [26]-[28]) will be in the focus of our attention. We use 
laboratory frequencies v and intervals of laboratory time t (cdt = a^rjujdr]). 
Formulas (38) and (39), with A = 10“®, V 2 /vh = 10^, and the oscillating factor 
restored, can be written as 

/ jy \/3+i 

h{v, t) « 10“^ cos[27Ti^(t — G)] ( — ) , 1^2 < (48) 






and 

h{v, t) 



10 ^ cos[27Ti/(t — tj,)] 




< 1/ < JZl 



(49) 



where the deterministic (not random) constant G does not vary significantly 
from one frequency to another at the intervals Av « v. The explicit time depen- 
dence of the spectral variance h?{v, t) of the field, or, in other words, the explicit 
time dependence of the (zero-lag) temporal correlation function of the field at 
every given frequency, demonstrates that we are dealing with a non-stationary 
process (a consequence of squeezing and severe reduction of the phase uncer- 
tainty). We will first ignore the oscillating factor and will compare the predicted 
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amplitudes with the sensitivity curves of advanced detectors. The potential re- 
serve of improving the signal to noise ratio by expoloiting the squeezing will be 
discussed later. 

Let us start from the Laser Interferometer Space Antenna (LISA) [24] . The in- 
strument will be most sensitive in the interval, roughly, from 10“^ Hz to 10“^ Hz, 
and will be reasonably sensitive in a broader range, up to frequencies 10“^ Hz 
and 1 Hz. The sensitivity graph of LISA to a stochastic background is usually 
plotted under the assumption of a 1-year observation time, that is, the root- 
mean-square (r.m.s.) instrumental noise is being evaluated in frequency bins 
Ah' = 3 X 10“® Hz around each frequency v. We need to rescale our predicted 
amplitude h{v) to these bins. 

The mean square amplitude of the gravitational wave field is given by the 
integral (29). Thus, the r.m.s. amplitude in the band Av centered at a given 
frequency v is given by the expression 



h{v, Av) = h{u)s^ . (50) 

We use Eqs. (48), (49) and calculate expression (50) assuming Av = 3 x 10“® Hz. 
The results are plotted in Fig. 8. Formula (48) has been used throughout the 
covered frequency interval for the realistic case [3 = —1.9 and for the extreme 
case (3 = —1.8. The line marked z-model describes the signal produced in the 
composite model with (3 = —2 up to Vg = 10“^ Hz (formula (48)) and then 
followed by formula (49) with (3s = —0.3. This model gives the signal a factor 
of 3 higher at v = 10“^ Hz, than the model (3 = —2 extrapolated down to this 
frequency. 

There is no doubt that the signal (3 = —1.8 would be easily detectable even 
with a single instrument. The signal f3 = —1.9 is marginally detectable, with 
the signal to noise ratio around 3 or so, in a quite narrow frequency interval 
near and above the frequency 3 x 10“^ Hz. However, at lower frequencies one 
would need to be concerned with the possible gravitational wave noise from 
unresolved binary stars in our Galaxy. The further improvement of the expected 
LISA sensitivity by a factor of 3 may prove to be crucial for a confident detection 
of the predicted signal with /3 = — 1.9. 

Let us now turn to the ground-based interferometers operating in the interval 
from 10 Hz to 10^ Hz. The best sensitivity is reached in the band around v = 
10^ Hz. We take this frequency as the representative frequency for comparison 
with the predicted signal. We will work directly in terms of the dimensionless 
quantity h{v). If necessary, the r.m.s. amplitude per Hz^^^ at a given v can be 
found simply as h(y)j^v. The instrumental noise will also be quoted in terms 
of the dimensionless quantity hexiy)- 

The expected sensitivity of the initial instruments at jz = 10^ Hz is hex = 
10“^^ or better. The theoretical prediction at this frequency, following from (48), 
(49) with f3s = 0, is hth = 10“^^ for (3 = —1.8, and hth = 10“^® for P = —1.9. 
Therefore, the gap between the signal and noise levels is from 2 to 4 orders 
of magnitude. The expected sensitivity of the advanced interferometers, such as 




188 



L.P. Grishchuk 




Fig. 8. Expected spectrum j3 = —1.9 and other possible spectra in comparison with 
the LISA sensitivity. 



LIGO-II [29], can be as high as h^x = 10“^^. In this case, the gap vanishes for the 
(i = —1.8 signal and reduces to 2 orders of magnitude for the j3 = —1.9 signal. 
Fig. 9 illustrates the expected signal in comparison with the LIGO-II sensitivity. 
Since the signal lines are plotted in terms of h(y), the LISA sensitivity curve 
(shown for periodic sources) should be raised and adjusted in accordance with 
Fig. 8. 

A signal below noise can be detected if the outputs of two or more detectors 
can be cross correlated. (For the early esimates of detectability of relic gravita- 
tional waves see [30].) The cross correlation will be possible for ground-based 
interferometers, several of which are currently under construction. The gap be- 
tween the signal and the noise levels should be covered by a sufficiently long 
observation time r. The duration t depends on whether the signal has any tem- 
poral signature known in advance, or not. We start from the assumption that no 
temporal signatures are known in advance. In other words, we first ignore the 
squeezed nature of the relic background and work under the assumption that 
the squeezing cannot be exploited to our advantage. 

The response of an instrument to the incoming radiation is s{t) = 
where Fjj depends on the position and orientation of the instrument. Since the 

is a quantum-mechanical operator (see Eq. (6)) we need to calculate the 
mean value of a quadratic quantity. The mean value of the cross correlation of 
responses from two instruments ( 0 |si(t)s 2 (i)| 0 ) will involve the overlap reduction 
function [31]- [34], which we assume to be not much smaller than 1 [33]. The 
signal to noise ratio S/N in the measurement of the amplitude of a signal with 
no specific known features increases as (tj^)^/^, where v is some characteristic 
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Fig. 9. Full spectrum h{v) accessible to laser interferometers. 



central frequency. If the signal has features known in advance and exploited by 
the matched filtering technique, the S/N increases as 

We apply the guaranteed law to initial and advanced instruments at 

the representative frequency v = 10^ Hz. This law requires a reasonably short 
time T = 10® sec in order to improve the S'/fV in initial instruments by two orders 
of magnitude and to reach the level of the signal with extreme spectral index 
(3 = —1.8. The longer integration time or a better sensitivity will make the S/N 
larger than 1. In the case of a realistic spectral index (3 = —1.9 the remaining gap 
of 4 orders of magnitude can be covered by the combination of a significantly 
better sensitivity and a longer observation time (not necessarily in one non- 
interrupted run). The sensitivity of the advanced laser interferometers, such as 
LIGO II, at the level /lex = 10“^® and the same observation time r = 10® sec 
would be sufficient for reaching the level of the predicted signal with f3 = —1.9. 

An additional increase oi S/N can be achieved if the statistical properties 
of the signal can be properly exploited. Squeezing is automatically present at 
all frequencies from vh to v\ . The squeeze parameter r is larger in gravitational 
waves of cosmological scales, and possibly the periodic structure in Eq. (31) can 
be better revealed at those scales. However, we are interested here in frequencies 
accessible to ground based interferometers, say, in the interval 30 Hz — 100 Hz. 
If our intention were to monitor one given frequency v from the beginning of 
its oscillating regime and up till now, then, in order to avoid the destructive 
interference from neighbouring modes during all that time, the frequency res- 
olution of the instrument should have been increadibly narrow, of the order of 
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10“^® Hz. Certainly, this is not something what we can, or intend to do. Although 
the amplitudes of the waves have adiabatically decreased and their frequencies 
redshifted since the beginning of their oscillating regime, the general statistical 
properties of the discussed signal are essentially the same now as they were 10 
years after the Big Bang or will be 1 million years from now. 

The periodic structure (48) may survive at some level in the instrumental win- 
dow of sensitivity from Vrnin (minimal frequency) to Vmax (maximal frequency). 
The mean square value of the field in this window is 

p^ma.x H /y 1 r^max 

/ — = 10“^'^ — / cos^[27Ti/(t - . (51) 

Because of the strong dependence of the integrand on frequency, or 

the value of the integral (51) is determined by its lower limit. Apparently, the 
search through the data should be based on the periodic structure that may 
survive &t v = As an illustration, one can consider such a narrow interval 
Av = — i^min that the integral (51) can be approximated by the formula 



— 14 / ‘^min 



COS [27T^'min(t tjnin)] ■ 



Clearly, the correlation function is strictly periodic and its structure is known in 
advance, in contrast to other possible signals. This is a typical example of using 
the a priori information. Ideally, the gain in S/N can grow as (rr'min)^^^- This 
would significantly reduce the required observation time r. For a larger Av, even 
an intermediate gain between the guaranteed law (rv)^^'^ and the law {Tvf!^, 
adequate for the matched filtering technique, would help. This could potentially 
make the signal with /3 = — 1.9 measurable even by the initial laser interferome- 
ters. A straightforward application of (51) for exploiting the squeezing may not 
be possible, as argued in the recent study [35], but more sophisticated methods 
are not excluded. 

For frequency intervals covered by bar detectors and electromagnetic detec- 
tors, the results expected follow from the same formulas (48,49) and have been 
briefly discussed elsewhere [30,19]. 



7 Conclusion 

It would be strange, if the predicted signal at the level corresponding to /3 = — 1.9 
were not seen by the instruments capable of its detection. There are not so many 
cosmological assumptions involved in the derivation that could prove wrong, thus 
invalidating our predictions. On the other hand, it would be even more strange 
(and even more interesting) if the relic gravitational waves were detected at 
the level above the [3 = —1.8 line. This would mean that there is something 
fundamentally wrong in our basic cosmological premises. To summarise, it is 
quite possible that the detection of relic (squeezed) gravitational waves may be 
awaiting only the first generation of sensitive instruments and an appropriate 
data processing strategy. 
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Abstract. Modern formulations of equivalence principles provide the foundation for 
an efficient approach to understanding and organizing the structural features of grav- 
itation field theories. Since theories’ predictions reflect differences in their structures, 
principles of equivalence also support an efficient experimental strategy for testing grav- 
itation theories and for exploring the range of conceivable gravitation physics. These 
principles focus attention squarely on empirical consequences of the fundamental struc- 
tural differences that distinguish one gravitation theory from another. Interestingly, the 
variety of such consequences makes it possible to design and perform experiments that 
test equivalence principles stringently but do so in markedly different ways than the 
most familiar experimental tests. 



1 Equivalence Principles and the Structure 
of Gravitation Theories 

1.1 From the Weak to Einstein’s Eqnivalence Principle 

Since the time of the Renaissance observations indicating that bodies fall in a 
gravitational field in a way that is independent of their internal composition and 
structure have been considered remarkable. Clearly Newton thought so since he 
deemed it necessary to perform pendulum experiments to verify this property of 
freefall as precisely as he could before he published laws of motion and universal 
gravitation that predict it [1] . Einstein also found this property of freefall remark- 
able. The insight he gained by reflecting on it in his famous elevator Gedanken 
experiment [2] is communicated by what we now call the Einstein equivalence 
principle (EEP). 

Einstein noted that if all bodies fall in the same way in an external gravita- 
tional held, an observer in freefall will And that freely falling bodies in his or her 
neighborhood move with uniform velocities relative to him or her and that the 
physics of pure particle mechanics in that neighborhood is indistinguishable from 
mechanics in the absence of gravity. This led him to suggest that a freely falling 
observer might And all other nongravitational physics in his or her neighborhood 
to be indistinguishable from such physics in the absence of gravity. Einstein then 
proceeded to show that if this were true for electrodynamic physics, which was 
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all fundamental physics at the time, light propagating out of a gravitational 
potential well must suffer a redshift. 

The following modern formulation of the EEP expresses the possibility sug- 
gested by Einstein in 1907. It states that the outcome of any local, nongravi- 
tational test experiment is independent of the experimental apparatus’ velocity 
relative to the gravitational field and is independent of where and when in the 
gravitational field the experiment is performed. This captures Einstein’s sugges- 
tion because, in principle, local nongravitational test experiments can be per- 
formed in spacetime regions where gravity is negligible. The two conditions that 
the EEP imposes are referred to as local Lorentz and local position invariance, 
respectively. Note that a local experiment is one performed within a spacetime 
region so small that the experimental apparatus can detect no tidal effects. A test 
experiment is one performed with an apparatus having a mass so small that the 
apparatus can detect no effect of the perturbation it induces in the gravitational 
field. 

From the perspective of 1907, the EEP is a striking generalization from the 
observed equality of test body accelerations in a gravitational field, and, as a 
matter of history, the gravitational redshift was the first physical consequence to 
be derived on the assumption that this generalization is valid. Given this history, 
it is not surprising that refinements of experimental tests of the universality of 
freefall acceleration and of measurements of the gravitational redshift remain 
among the most widely recognized tests of the validity of the EEP. Efforts to 
refine both kinds of test continue today. Readers can refer to Lute Maleki’s 
article [3] in this volume for details on a proposed space-based variation on 
gravitational redshift measurements. 

Tests of the universality of free fall acceleration are often referred to as Eotvos 
experiments because of the classic torsion-balance version performed by Baron 
von Eotvos and collaborators early in this century [4]. As noted above, the pro- 
cess of refining such tests continues today. For example, the group of Adelberger 
at the University of Washington [5] recently reported results of a torsion-balance 
experiment that include the conclusion that the gravitational accelerations of 
beryllium and copper test bodies toward the Earth are equal to better than 2.5 
parts in 10^^. The ultimate refinement of tests of the equality of such accel- 
erations may well be represented by the space-based STEP experiment under 
development at Stanford University. Readers can refer to an article in this vol- 
ume for details on STEP [6]. 

Interest in such experiments remains high because our understanding of their 
significance as tests of the EEP has evolved significantly since 1907. The clear 
distinction Einstein made at that time between particle mechanics and other 
nongravitational physics, specifically electromagnetic physics, is no longer vi- 
able. We now understand how test bodies are composed of atoms and that they, 
in turn, are composites of the mass-energy of nucleons and electrons and of the 
electromagnetic, weak- and strong-interaction binding energies of these parti- 
cles. Consequently, the freefall acceleration of test bodies can be influenced by 
many, if not all, aspects of nongravitational physics in an external gravitational 
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field, and experiments which test the universality of such accelerations turn out 
to be more profound tests of the EEP than one could have realized in 1907. We 
will return to this point later in section 2.1. 

Set aside, for a moment, the issue of whether or not the EEP is valid and 
questions regarding the precision to which we may be able to establish exper- 
imentally that it is valid. Is it possible to formulate a gravitation field theory 
which predicts that it is valid? It took Einstein almost ten years from the time 
of his 1907 insight to establish that the answer to this question is yes. He did 
so by formulating general relativity [7]. Interested readers can refer to the re- 
cent review by Norton [8] for a discussion of the fascinating and in many ways 
still controversial history of Einstein’s development of general relativity and of 
others’ early attempts to understand the theory. This story is plagued by many 
formulations of the equivalence principle, confusion regarding the significance of 
coordinates and covariance and so on. We will not delve further into it here. 
Instead, we discuss the kind of analysis one must do to determine whether or 
not any given gravitation field theory predicts the validity of the EEP. 



1.2 Theoretical Contexts for Analyses of the EEP 

If an analysis reveals outcomes of some local nongravitational test experiment 
that depend on the velocity of the experimental apparatus relative to an external 
gravitational field or on where or when in that field the experiment is performed, 
it is clear that the underlying theory is nonmetric, that is, it violates the EEP. 
Significantly, such an analytical result also provides the basis for actual exper- 
iments that search for the specific preferred-frame or preferred-location effect 
revealed by the analysis. This approach has led to the development of many 
stringent new tests of the EEP that are quite different from familiar Eotvos ex- 
periments and gravitational redshift measurements. It has also clarified which 
structural features of gravitation field theories are constrained by experimental 
evidence that the EEP is valid to some level of precision. 

Lagrangian field theory provides a natural setting for a general discussion 
of gravitation field theories. However, we note that much of what follows can 
be discussed in terms of gravitation field equations and matter-field equations 
of motion. Indeed, a number of recent papers exploit this latter approach to 
consider modifications of the Maxwell equations caused by quantum gravity ef- 
fects. Papers by Gambini and Pullin [9] and by Ellis et al. [10] are examples. 
The paper of Haugan and Lammerzahl [11] begins the analysis of physical con- 
sequences of a broad range of conceivable Maxwell equation modifications of this 
kind and the consideration of experiments that could detect them or constrain 
their magnitudes. 

Returning to Lagrangian-based gravitation theories, we recall that each ad- 
mits a formulation via an action principle. 



S / £{tpg,'tpm) d'^x = 0. 



( 1 ) 




198 M.P. Haugan and C. Lammerzahl 



Here, ipg denotes dependence of the Lagrangian density on gravitational poten- 
tials and their derivatives, and denotes dependence on matter fields and their 
derivatives. Not that long ago, one would have restricted attention to theories 
in which field derivatives appear only in conventional “kinetic” terms in C, but 
attitudes have changed so that field theories tend to be viewed as effective rather 
than fundamental theories. Consequently, there is now a greater willingness to 
consider dependence on higher-order derivatives and the presence of derivative 
couplings between fields. Such things can make theories nonrenormalizable, but 
this is not the issue for effective theories that it is for fundamental ones. 

A theory’s Lagrangian density C can be split into a purely gravitational part 
and “nongravitational” remainder, C = Cg + Lng. The gravitational part Lg 
depends only on gravitational potentials and their derivatives. Its form specifies 
the dynamics of free gravitational fields in the theory. The nongravitational part 
Lng depends on gravitational potentials and their derivatives and on matter 
fields and their derivatives. Its form specifies the coupling between matter and 
gravity in the theory. Its form determines both how matter responds to gravity 
and how matter acts as a source of gravity. 

1.3 The Role of Locality 

The matter fields involved in a local, nongravitational test experiment do not 
perturb gravitational potentials to a degree that the experiment can detect. It 
follows that when using a theory to predict the outcome of such an experiment we 
can treat gravitational potentials as specified functions which represent the grav- 
itational environment generated by some source. They are external potentials. 
To predict the outcome of the experiment one needs only a theory’s equations 
which govern the evolution of matter fields in the relevant gravitational envi- 
ronment. We derive these gravitationally-modified equations of motion from the 
theory’s action principle (1) by considering variations of the matter fields while 
keeping the external gravitational potential functions fixed in the appropriate 
form. Consequently, the outcomes of experiments that directly test the EEP de- 
pend on the form of a theory’s nongravitational Lagrangian density Lng alone. 
Clearly, evidence that the EEP is valid to some precision can constrain only the 
manner in which matter couples to gravity. 

Since experiments that directly test the EEP are local as well as test ex- 
periments, their outcomes are insensitive to the global form of external gravi- 
tational potentials. It is sufficient to consider initial terms of the Taylor-series 
expansions of external potentials when using a theory’s action principle to pre- 
dict a local experiment’s outcome. The expansions should be made about an 
event inside the experimental apparatus during the course of the experiment. 
Keeping terms through first order is sufficient to predict the outcome of any 
local experiment. However, in some important cases it is sufficient to keep only 
zeroth-order terms. In particular, this can be the case when an experiment is 
completed quickly enough that the experimental apparatus can detect no effect 
of external potential time dependence and no effect of accelerations induced by 
external potential spatial dependence. This is generally true of realistic local 
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experiments which measure atomic transition frequencies, for example, and ex- 
plains why atomic clocks may generally be treated as realizations of ideal clocks 
in the sense defined by theories of relativity. 

To make the remainder of this discussion a bit more concrete focus on exper- 
iments involving such atomic transitions. To this point, we have established that 
a theory’s predictions of their outcomes follows from the form that its nongravi- 
tational Lagrangian density takes when values of the external gravitational 
potentials and, if there are derivative matter-gravity couplings, values of their 
derivatives at an event inside the experimental apparatus during the experi- 
ment are plugged in. The resulting Lagrangian density has no explicit depen- 
dence on the spacetime coordinates and involves only matter fields. It determines 
the gravitationally-modified equations of motion which govern the structure of 
atoms treated as local test bodies. Note, however, that if we are interested only 
in atomic transition frequencies, we need not deal with these equations. We can, 
instead, compute energies of atomic states directly. 

1.4 Relevant Observables 

An expression for such energies follows from the form of the Lagrangian den- 
sity introduced in the preceding paragraph because it is time independent. If 
natural coordinates are used in representing a theory’s action principle (1), the 
energy expression’s form will be that of the Standard Model Hamiltonian plus 
perturbating terms. In this context, natural coordinates are ones in which the 
form of the representation of the theory’s nongravitational Lagrangian density 
C„g reduces to the familiar representation of the Standard Model Lagrangian 
density as gravity is “turned off.” Schwarzschild coordinates provide a familiar 
example of natural coordinates in the context of general relativity and situations 
in the external gravitational potential is static and spherically symmetric. 

Accurate estimates of the energies of atomic states are easily computed when 
the gravitationally-modified Hamiltonian is a perturbed Standard Model Hamil- 
tonian. In general, the results depend on an atom’s velocity through and location 
in the external gravitational potentials. This is the case because the perturbing 
Hamiltonian terms reflect not only the form of the gravitation theory’s nongrav- 
itational Lagrangian density Cng but also the atom’s gravitational environment. 
This environment is represented by the values in the atom’s neighborhood of 
the external gravitational potentials and, if there are derivative matter-gravity 
couplings, the values of their derivatives. 

Despite their velocity and location dependence these computed atomic state 
energies may not represent preferred-frame or preferred-location effects that 
signal violation of the Einstein equivalence principle. Even when we are using 
natural coordinates they are merely coordinate energies. Only velocity or location 
dependence of an experimentally measured atomic state energy, or frequency of 
a transition between such states, would constitute a genuine preferred-frame or 
preferred-location effect. 

The distinction between coordinate energies or frequencies and measured en- 
ergies or frequencies is, in some respects, subtle. However, it is not difficult to 
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appreciate if one remembers that, fundamentally, any measurement is simply 
the comparison of a property of some system of interest to the corresponding 
property of a chosen standard system. Thus, a measurement of the frequency of 
a transition between some pair of atomic states is simply a comparison of its co- 
ordinate frequency to the coordinate frequency of a selected standard transition. 
For example, one can imagine locking a laser to the transition whose frequency 
is to be measured and a second laser to the frequency of the standard transition. 
The coordinate frequency of each laser can depend on velocity through or loca- 
tion in the external gravitational potential, but the relative or beat frequency 
between them, which is the measured frequency, may not. While it may seem far- 
fetched to imagine cases in which coordinate energies of all atomic states depend 
on velocity through or location in external gravitational potentials in precisely 
the same way, thus, causing such dependence to cancel from measured energies, 
this is precisely what metric theories of gravity like general relativity predict. 
The preferred-frame or preferred-location effects predicted by nonmetric theo- 
ries of gravity occur because their matter-gravity couplings distinguish between 
the contributions of rest-mass and of different types of interaction-energy to 
the energies of atomic states and, so, prevent such universal cancellations of 
coordinate effects. If such couplings are present, the observable frequency ratio 
for a pair of atomic clocks whose ticking rates are governed by different atomic 
transition and which move together through an external gravitational potential 
can depend on the clocks’ location in and velocity through the potential. In the 
limit of slow motion and weak gravitation this frequency ratio takes the form 



V2{x,v) 



4 + 
^2 



(2) 



where the a;* denote natural spatial coordinates that reduce to Cartesian ones 
as gravity is turned off, the u* denote corresponding components of the clocks’ 
coordinate velocity and W^{x) denotes the usual Newtonian gravitational po- 
tential tensor at the clocks’ location. The parameters and f3ij depend on 
the particular transitions controlling the atomic clock rates, except in metric 
theories of gravity which predict that they all vanish. Their tensor character 
reflects the fact that the orientation of the atoms whose transitions govern the 
atomic clock rates can affect the observed frequency ratio (2) . Nonvanishing 
and f3ij parameters characterize preferred-frame and preferred-location effects, 
respectively. 

To conclude this discussion of the way in which Lagrangian-based gravitation 
theories predict outcomes of local, nongravitational test experiments, briefly con- 
sider the acceleration of test bodies in an external gravitational field once more. 
We focus on the freefall of atoms since, as we noted earlier, realistic test bodies 
are simply assemblages of them. 

The analysis of atomic systems outlined above yields an expression for the 
coordinate energy of any atom in any state of interest. This energy is a function of 
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the atom’s velocity relative to and location in an external gravitational potential 

E = m<? + -TO f 6ij + \ \ ^ (' 3 ^ 

2 \ TO / \ TO / 

where Smuj and Srrigij are the anomalous inertial and gravitational mass tensors. 
They depend on the particular state of the particular atom under consideration, 
except in metric theories of gravity which predict that they vanish. The aij and 
f3ij parameters appearing in ( 2 ) are determined by the anomalous mass tensors 
of the states involved in the atomic clock transitions considered above. 

When the external gravitational potential is time independent, the coordinate 
energy function (3) is globally conserved and its dependence on atomic velocity 
and location determines the atom’s coordinate acceleration via familiar energy 
conservation arguments, 

a* = dj U + dj U + 6^^ djU'^‘{x), (4) 

TO TO 

(here Sml^ = Smaj). Kenneth Nordtvedt [12] and Mark Haugan [13] exploit 
such arguments to relate the outcomes Eotvos experiments to the outcomes of 
gravitational redshift measurements and other tests of the EEP. 

In the end, the preceding overview of the kind of analysis one must do to 
determine the outcomes of local nongravitational test experiments predicted by 
Lagrangian-based gravitation field theories brings one full circle. We have come 
back to the most familiar experimental tests of the EEP, but with a deeper 
appreciation of the significance of their results. In the next section we consider 
examples of nonmetric theories and formalisms encompassing whole classes of 
such theories within which preferred-frame and preferred-location effects have 
been analyzed to provide a basis for testing the EEP. Much of the work on 
tests of the EEP done before 1993 is thoroughly reviewed in the early chapters 
of Clifford Will’s Theory and Experiment in Gravitation Physics [14], for an 
update see [15], see also [16]. 

2 Theoretical Frameworks for the Analysis of EEP Tests 

The approach outlined in the preceding section can be used to determine prefer- 
red-frame and preferred-location effects predicted by any nonmetric theory of 
gravity. Such effects reflect the form of the theory’s nongravitational Lagrangian 
density once the external gravitational potential in which a local nongrav- 
itational test experiment is performed has been plugged in. Proceeding in this 
fashion, we would have to analyze and reanalyze any given experiment to deter- 
mine its outcome as predicted by competing theories. 

It is, instead, more efficient to analyze local nongravitational test experi- 
ments once and for all within the context of more general theoretical frameworks 
that encompass broad classes of nonmetric gravitation theories and gravitational 
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environments. Such frameworks are based on models of the nongravitational La- 
grangian or corresponding matter field equations that depend on phenomenolog- 
ical gravitational potentials in ways that encompass the forms of these structures 
in many nonmetric theories and environments. The outcome of an experiment 
predicted within such a framework immediately yields the outcome predicted by 
any nonmetric theory it encompasses when the framework’s phenomenological 
gravitational potentials are expressed in terms of the particular theory’s poten- 
tials. The efficiency of this approach is somewhat like that provided by the PPN 
formalism [14] when dealing with the gravitational dynamics of metric theories. 

Analyses carried out in the general nonmetric frameworks discussed below 
have the additional benefit of identifying mechanisms that lead to preferred- 
frame or preferred-location effects in entire classes of nonmetric theories and of 
providing theory-independent parametrizations of such effects that are useful in 
discussing the results of experiments designed to search for them. 

2.1 The TiTe/x Formalism 

The Tile/x-formalism (see [17,14]) is based on the form of the Lagrangian gov- 
erning the dynamics of point particles with mass nii and charge qi and of the 
electromagnetic field in a static, spherically symmetric background gravitational 
field described by the phenomenological gravitational potentials T, H, e and qi: 

L = —'^^mi J \!t — Hxf dt + qi J Aaii dx°' + — J (eE^ 

i i V M / 

( 5 ) 

A striking feature of this framework and the nonmetric theories it encompasses is 
that the limiting speed of massive particles in the neighborhood of some point in 
the gravitational field can differ from the speed of light there. These coordinate 
speeds are given, respectively, by the values of \/T/H and \j at the point 
of interest. Preferred-frame effects result when the ratio of these speeds is not 
unity. Variation of the relative values of T, iJ, e and ^ with position in the 
gravitational field can also lead to preferred-location effects. Computations of 
the energies of atomic states using natural quantum mechanical extensions of the 
classical THefi Lagrangian reveal both kinds of effects and yield predictions for 
anomalous inertial and gravitational mass tensors (compare Eq.(3)) [13,18,19]. 
A quantum field theoretic extension of the formalism reveals EEP violations 
discernable in measurements of the Lamb shift, the anomalous magnetic moment 
of the electron and related phenomena [21]. 

This test theory has been widely used to interpret the results of experimental 
tests of the EEP. For example, its predictions of the energies of atomic states 
[13], [18] and [19] have be used to interpret Hughes-Drever type experiments as 
well as the Vessot-Levine rocket redshift experiment [20,14]. 

Originally conceived as a framework for analyzing the physics of charged 
particles and electromagnetic fields in an external gravitational field, the THeqc- 
formalism has also been extended in a natural way to cover the other sectors of 
nongravitational physics comprising the Standard Model [22]. 
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2.2 The xgr Formalism 

Like the TiJe/r-formalism the x^-formalism introduced by W.-T. Ni [23] origi- 
nally provided a framework for the analysis of electrodynamic physics in a back- 
ground gravitational field and has subsequently been extended to cover other 
sectors of the Standard Model. Unlike the TiLe/x-formalism the y^-formalism is 
not restricted to static, spherically symmetric gravitational environments. The 
X of its name refers to a tensor field appearing in the electromagnetic part of 
the nongravitational Lagrangian density upon which the formalism is based, 

Am = (6) 

The independent components of this tensor comprise twenty-one phenomeno- 
logical gravitational potentials capable of representing gravitational fields in a 
very broad class of nonmetric gravitation theories. 

The coupling of one particular phenomenological potential to the electro- 
magnetic field is interesting because it can be expressed as a purely derivative 
coupling to a pseudoscalar field A particle physicist would describe it as an 
axion coupling. A relativist would describe it as a coupling to axial torsion [24]. 
The Hojmann-Rosenbaum-Ryan-Shepley theory [25] is but one example of a 
theory encompassed by Ni’s yg-formalism. It features a novel torsion coupling 
that has been shown to predict effects inconsistent with the results of experi- 
mental tests of the weak equivalence principle. 

2.3 The Kostelecky Formalism 

String theory has the potential to provide a quantum theory of gravity that 
is unified with other fundamental theories of matter and interactions. Recently 
Colladay and Kostelecky have introduced a framework for treating the possi- 
bility of spontaneous breakdown of Lorentz symmetry in the context of string 
theory [31,32]. While somewhat different from the sources of preferred-frame 
effects considered to this point, these string induced effects are considered here 
because they lead to modifications of the Dirac and Maxwell equations like those 
considered in the next subsection. 



2.4 Formalisms Based on Matter Field Equations of Motion 

The effects an external gravitational field on the dynamics of matter fields can 
be dealt with at the level of equations of motion rather than Lagrangians. A 
formalism based on forms of the equations of motion has the advantage of directly 
addressing the following natural requirements one would demand of the dynamics 
of any quantum field (i) deterministic evolution, (ii) the superposition principle, 
(iii) a finite propagation speed (whose maximum value, since it need not be 
isotropic, we call cd ) and (iv) the conservation of probability. 
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The equations governing the motion of a Dirac field which satisfy these re- 
quirements are a first-order hyperbolic system of the form 



0 = + M(f , 

or, in 3 -|- 1-form (/t = 1, 2, 3) 

id()(p = coct^cdpip + cdFip + 



( 7 ) 

(8) 



which we call a generalized Dirac equation (5^ = ( 7 °)“^ 7 ^)- The matrices 
are not assumed to define a Clifford algebra, instead they satisfy 7 ^ 7 “^ -|- 7 *^ 7 '^ = 
where = jtr( 7 ^ 7 '^) and X^'' is a matrix. In general, M is also a 
matrix. A distinctive feature of this generalized Dirac equation is that it predicts 
a splitting of the null cones and mass shells. (For another modification of the 
Dirac equation see [26] in this volume.) 

Taking the non-relativistic limit and specifying a general position-depen- 
dence of the matrices 7 ^ and M, one derives the generalized Pauli equation [27] 



. d 1 / i ■ (5mF 



m 



didjip + 





uHdiip 



(9) 



mU{x) + C ■ crmU{x) + SnigijU^^ (x) + cdT ■ cr + mc^B ■ 






where the anomalous coefficients i5toF, SmU, A*, a*-, C, 5mW B stem from those 
parts of the 7 -matrices which prevent them from defining a Clifford algebra and 

Sfhy 

from anomalous terms in the mass matrix M, for example, — I — = 

\{1 + where (3 is the usual Dirac 7 ° and cr* are the usual Pauli matrices. 

The generalized Pauli equation predicts preferred-frame and preferred-location 
effects. Terms like those representing couplings between spin and the Newtonian 
gravitational potential were first introduced in references [28,29] and [30]. 

The Pauli equation (9) is a generalization of Schroedinger equation provided 
by M . Haugan’s approach [13] to the dynamics of scalar matter. As in the case 
of preceding formalisms, this Pauli equation provides a basis for broad range 
of experimental tests of the EEP, including experiments exploiting matter-wave 
interferometry. The classical limit of the generalized Pauli equation describes the 
free fall of classical spin-polarized bodies. 



a* = S^^djU + 



6rn3 



+ 2 



Smll 






d,U 



.Sv 



6m, 



gkl 



djU^\x). 



( 10 ) 



Notice that not all of the anomalous parameters appearing in the quantum 
equation (9) survive in the classical freefall acceleration. Only by considering 
the evolution of the spin as well can one design experiments in the classical limit 
that are sensitive to all possible anomalies. 

Once a generalized Dirac equation ( 8 ) is available we can address the dynam- 
ics of the electromagnetic field in an analogous way. The electromagnetic field 
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can be defined operationally by considering the phase shifts in charged particle 
interferometry. Assuming that the dynamics of electromagnetic fields satisfies 
the same requirements as we demanded for the dynamics of the Dirac field, this 
leads to generalized Maxwell equations of the form, 

di^F^p] = 0 , 4TTf = + X^P^Fp, . ( 11 ) 

In the case of small deviations from minimal coupling to the Riemannian space- 
time metric we have with small values of 

and XPP'^ . Clearly, SX^''^'^ can induce anistropic propagation of light and bire- 
fringence. The X^P'^ can also modify propagation, in some cases leading to a 
damping of electromagnetic waves. 

The generalized Dirac equation (8) and Maxwell equations (11) can be used 
just as the corresponding equations that emerge from the Ti/e/i-formalism or 
the yg-formalism, respectively, to analyze to properties of atoms in background 
gravitational fields. They do, however, encompass a wider range of nonmetric 
couplings that influence spin and polarization. Consequently, they provide the 
broadest possible basis for the interpretation of experimental tests of the EEP. 

3 Motivations for Continued Testing of the EEP 

Although all tests of the EEP, including some of remarkable precision, have 
so far failed to detect any hint of a violation, recent theoretical developments 
continue to suggest the EEP must be violated at some level. All approaches to 
quantizing gravity and to unifying it with the other fundamental interactions 
currently under study are capable of predicting such violations. 

3.1 String Theory 

Today, string theory is among the most promising candidate for a theory of 
quantum gravity fully unified with other fundamental interactions, and it has 
been shown to predict a variety of EEP violations. 

For example, departures from universal free fall accelerations have been com- 
puted in references [33,34]. The composition-dependent component of test-body 
acceleration is estimated to be as large as a part in 10^® of the mean gravita- 
tional acceleration. The proposed STEP experiment could easily detect such an 
anomaly (see [6]). 

String theory can also predict a time-varying fine structure constant because 
of couplings to scalar (dilatonic) fields, for example, see [35]. 

In the latest versions of string theory, physical particles and fields are confined 
to the neighborhoods of D-branes and their propagation may be affected by 
recoil of the branes caused by that propagation [36]. The effect of this recoil 
can be accounted for via an energy-dependent effective metric. This leads to 
modifications of the Maxwell equations, 

V-E + u-dtE = 0 V-B = 0 (12) 

V X B-{l-u^)dtE + uxdtB + {u-V)E = 0 V x E = -dtB , (13) 
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Fig. 1. Sources of violations of the EEP and classes of experiments or observations 
that are sensitive to them. 



which predict dispersive light propagation. Analogous modifications of the Dirac 
equation account for the effect of brane recoil on the propagation of neutrinos 
and other fermions [37], 

■ V)^/> = 0 . (14) 



3.2 Loop Quantum Gravity 

In the nonperturbative approach to quantum gravity based on observables anal- 
ogous to Wilson loops, the semi-classical gravitational field is described via 
expectation values in so-called “weave” states. Gambini and Pullin [9] discuss 
the propagation of light through a gravitational field represented by a parity- 
violating weave state and find a polarization dependence of light propagation 
inconsistent with the EEP. The weave state is characterized by the length scale 
L, and gives rise to effective Maxwell equations [9] of the form 

dtE ^ X B + 2xepV^B (15) 

dtB^V xE-2x£pV^E. (16) 

A corresponding effective Dirac equation [38] has the form 

{iPf‘^da-m + Y'’dadb)^ = 0 (17) 

with 7“ = 7“-hKi^G?-hK2 G^ + . . . , TO = TO-hAi^A/i-hAa M2 + . . . , 

and 7“'' = /xi ^Gf’+iJL2 (x) . . , where 7“ are the usual Dirac 7-matrices, 
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m is the usual mass of the Dirac particle, G“, Mi, and Gf^ are arbitrary matrices, 
fp is the Planck length and the coefficients Ki, Xi and fii are of the order unity 

(t=l,2,...). 

Since these field equations feature second-order spatial derivatives they are 
no longer hyperbolic and clearly single out a preferred frame. In addition, note 
that equation (16) modifies the homogeneous Maxwell equations, disrupting the 
relationship between field and 4-vector potential! 

3.3 Gauge Theories of Gravity and Other Possibilities 

Gauge theories of gravity like the Poincare gauge theory [39] that leads to a 
Riemann-Cartan geometry, or the gauge theory of a linear group that leads to a 
metric-affine theory [40], gives rise to additional gravitational fields like torsion 
and, in the latter case, to nonmetricity. If these additional fields couple directly 
to matter, they can break local Lorentz invariance by singling out a preferred 
frame as well as breaking local position invariance. 

In supergravity theories, which gauge the super-Poincare group, torsion 
emerges as a bilinear combination of fundamental spin-|-field, see reference [41], 
for example. 

Though invented long ago, Kaluza-Klein theories arise as a low-energy limits 
of string theory, with all that that entails regarding the validity of the EEP, see 
reference [42], for example. 

Finally, we note that nonsymmetric theories of gravity, like those devised by 
John Moffat, have been shown to predict departures from universal free fall and 
violations of local Lorentz invariance in the electromagnetic sector [43] and [44] . 

4 Experimental and Observational Tests of the EEP 

In principle, the outcomes of almost any experiment of observation conducted 
in different gravitational environments could yield evidence of the breakdown of 
the EEP. There are, however, certain classes of experiments and observations 
that are sensitive to characteristic violations of the EEP revealed by analyses 
within the theoretical frameworks discussed in section 2. 

4.1 Tests of the Universality of Freefall 

Tests with bulk matter. Experiments that search for composition-dependence 
of the freefall acceleration of macroscopic samples of matter are direct tests of 
the weak equivalence prinicple, one consequence of the EEP. It can be tested in 
traditional Eotvos fashion using torsion balance technology as in reference [5] or 
by monitoring the relative motion of freely falling bodies as in the Bremen drop 
tower experiment [45] and in the proposed MICROSCOPE [46] and STEP [6] 
space-based experiments. To date, the highest precision, of order 10“^^, has been 
achieved by torsion balance experiments, but the STEP experiment is designed 
to reach a precision of 10“^®. 
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Equation (10) shows that macroscopic samples of spin-polarized matter may 
experience different gravitational accelerations than unpolarized matter. Tor- 
sion balance experiments looking for such differential accelerations have been 
conducted by by Ritter, Gillies and coworkers [47,48]. They find no evidence of 
new spin-dependent forces. 



Tests with quantum particles. As we saw in section 2, nonmetric theories of 
gravity can predict that quanta of different kinds fall with different accelerations 
in a gravitational field. Historically, there has been a great deal of interest in 
direct searches for such effects, especially in comparing the free fall acceleration 
of particles and antiparticles. 

The first test of this type was performed by Witteborn and Fairbank [49] 
who tried to measure the gravitational acceleration of charged particles. A little 
later, Koester [50] showed that neutrons fall with the same way as classical bulk 
matter to an accuracy of a few percent. This result has been verified by means 
of neutron interferometry [51]. 

The potential for future matter interferometry tests of the EEP seems bright. 
Atomic interferometers have recently determined the gravitational acceleration 
toward the Earth to a part in 10® and yield results consistent with the mea- 
sured acceleration of bulk matter. Refinements of these devices are expected to 
produce still more precise results and can be used to search for spin-dependent 
accelerations like those in (10). 

4.2 Spectroscopic and Atomic Clock Tests of the EEP 

As noted in preceding sections atoms are composites of the mass-energy of nu- 
cleons and electrons as well as of their electromagnetic and weak- and strong- 
interaction binding energies. Nonmetric theories whose matter-gravity couplings 
distinguish between these various contributions can cause not only gravitational 
accelerations that differ from atom to atom but also shifts in the energy spac- 
ings of atomic states that depend on an atom’s velocity through or location in 
its gravitational environment. Spectroscopic and atomic clock experiments can 
search directly for these kinds of preferred-frame and preferred-location effects. 

In Hughes-Drever-like experiments, for example, see Ref. [52], one searches 
for relative shifts between the frequencies of ground-state hyperfine transitions 
depending on atomic orientation in the gravitational environment. The inter- 
pretation of this type of experiment in the context of the TiJey^-formalism is 
discussed in reference [14]. The interpretation in the context of the test theory 
of section 2.4 is discussed in reference [27]. 

Spectroscopic methods can also be used to search for the effects of spin- 
dependent and other EEP-violating effects predicted by equation (9), for ex- 
ample, see [53,27]. These techniques have been used to verify the spin-rotation 
coupling [54,55] in a search for anomalous spin-couplings [56]. 

Atomic clock technology is a particular, refined application of spectroscopic 
technique. Experiments that monitor the relative rates of different types of 
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atomic clock for dependence on the clocks’ velocity through or location in a 
gravitational field provide another kind of spectroscopic test of the EEP. In 
essence, such tests are either tests of relativistic Doppler shifts that are sensitive 
to the parameters in (2) or tests of gravitational redshifts that are sensitive 
to the parameters (3ij. The difficulty of moving clocks through large changes in 
gravitational potential or at speeds approaching that of light limits the precision 
of such experiments. However, the Gravity Probe A experiment [57] succeeded 
in imposing the constraint \(3ij\ < 10“^. A recent experiment employing trapped 
Lithium atoms moving at 6.4% of the speed of light [58] was able to impose the 
constraint \aij\ < 10“®. Atomic clock technology has also been used to constrain 
EEP-violating time-dependence of the fine-structure constant [59] . 



4.3 EEP Tests Involving Observations of Wave Propagation 



Observations of the propagation of electromagnetic waves or other fields through 
a gravitational field are, in sense, a kind of experiment examing the effects of 
freefall. We discuss them separately, however, because of the distinctive way in 
which local effects that we think of as directly signalling violations of the EEP 
are allowed to build up as waves propagate over long distances. 

Departures of the form of the Maxwell equations from their usual metric 
form induced by nonmetric couplings to gravity can lead to dispersive wave 
propagation or birefringence. Analogous departures of the Dirac equation from 
its usual metric form can also lead to dispersive propagation and make mea- 
surements of the arrival times of photons and neutrinos emitted from the same 
astrophysical event a test of EEP. The existence of very short duration events 
like supernova explosions and gamma ray bursts in combinations with the build 
up of gravitational delays over very great distances makes sharp tests possible. 

Recently, limits on gravity-induced dispersion of electromagnetic wave prop- 
agation have been inferred from observations of quasars and gamma ray bursters 
[60]. They constrain _ 

Cj(lo) — Cj{lOo) / 10 ^ 

f \ 1 4 - ) 1 -^°/ 



where Cj(ujo) is the velocity of the photon for a given frequency w or po- 
larization ±. Exploiting rapid time variation of gamma ray bursters, Schae- 
fer [60] is able to to impose sharp constraints on gravity-induced dispersion, 
|(c.y(a;) — Cj(uJo))/c-y(uJo)l < 6 x 10“^^ for uj ~ 10^® Hz and wq 10^® Hz. See 
also [62] for implications of such data for quantum gravity models. Data con- 
straining gravity-induced differences between the speed at which photons and 
neutrinos propagate are also imposed. 



- c${uj) 
% (^) 



(19) 



is also available. 

Finally, observations that constrain differences between the speeds with which 
light with different polarizations propagates through a gravitational field have 
been analyzed [61] leading to the constraint |(c+ — c“)/c+| < 10“^®. 
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Abstract. This paper presents an overview of the current technical status of STEP, 
the Satellite Test of the Equivalence Principle. STEP was originally presented as a 
candidate for ESA’s M2 mission as a joint mission with NASA, and has since been 
studied as an M3 candidate, and under NASA as QuickSTEP and MiniSTEP. Studies 
especially during the last two years have resolved some long standing issues such as 
control of helium tide, improved the mission definition and error analysis, and have 
resulted in an improved baseline design which should be capable of comparing rates of 
fall to an accuracy approaching 



1 Background 

When two bodies are allowed to fall freely in a uniform gravitational field, they 
fall with the same acceleration regardless of their composition. This perplexing 
observation, known as the Universality of Free Fall, is both consequence and 
proof of the Equivalence Principle (EP), the fundamental concept that inertia 
and gravitational mass are the same physical property. This principle has been 
confirmed in both ground-based and lunar-ranging experiments with an accu- 
racy of about a part in 10^^. The Satellite Test of the Equivalence Principle 
(STEP) is intended to test the EP in earth orbit to an accuracy of 1 part in 
10^®. Rarely is it possible in science to advance in one leap the testing of an 
already well-established principle by six orders of magnitude, and do so in a 
range that is scientifically significant. 

The Universality of Free Fall goes back in essence to Galileo’s observation 
that two bodies of different composition dropped from the Leaning Tower of Pisa 
fall with the same acceleration. The deep significance of this discovery was first 
demonstrated by Newton. He distinguished two properties of a body, its ’weight’ 
and the ’quantity of matter’ in it, or as we would say, its gravitational mass mg 
and inertial mass Wj, and he demonstrated that Universality of Free Fall makes 
the ratio mg /mi identical for all bodies regardless of composition. Even more 
importantly, he emphasized that it is exactly this property that marks off gravity 
from all the other forces in nature. Specifically, he pointed out that nothing like 
it applies to magnetism. 

The term EP was coined by Einstein in 1907 in his famous ’falling elevator’ 
argument. For Einstein, Equivalence was a more far-reaching concept than sim- 
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ply the Universality of Free Fall. His conjecture was that not just falling bodies 
but all the laws of physics respond in the same way to either a gravitational ac- 
celeration or an inertial acceleration. He made the EP the basis for his idea that 
gravity is a consequence of the geometry of spacetime. Equivalence is therefore 
both the historical foundation and the best-established fact of General Relativ- 
ity. Hence any breakdown of Universality of Free Fall would be in fundamental 
contradiction with Einstein’s theory. 

General Relativity provides the theoretical basis for our present description 
of the macroscopic world: the Big Bang, the cosmological expansion, the large- 
scale structure of the Universe, the end points of stellar evolution, gravitational 
collapse, neutron stars, black holes, gravitational waves, relativistic celestial me- 
chanics of the solar system, the high-precision description of the motion of nat- 
ural and artificial satellites, the definition of the international atomic time, etc. 
Nevertheless, General Relativity faces two obvious problems: it leaves the grav- 
itational interaction apparently disconnected from the other interactions in na- 
ture, and it resists all traditional attempts at quantization. Historically, numer- 
ous alternative theories of gravity, have been proposed, including the Kaluza- 
Klein-Jordan theory, Brans-Dicke theory, Fierz-Brans-Dicke theory, and others, 
all of which predict violations of Equivalence at some level. Modern theories, in- 
cluding the Standard Model and string theory, either have internal deficiencies 
which require new fields to resolve — leading to a violation of Equivalence — 
or predict new fields such as the dilaton of the order of the strength of gravity, 
with the same result. Damour and Polyakov [1] devised a mechanism in which 
the dilaton field is attracted toward values that nearly decouple from matter, 
resulting in the prediction of an EP violation in the range between 10“^^ and 
10“^^, much of which is accessible to STEP. 

Historically, there have been four important methods of testing Equivalence: 
(1) Galileo’s free fall method, (2) Newton’s pendulum method, (3) Newton’s 
celestial method (using observations either of the earth-moon system or of the 
moons of Jupiter in the sun’s field), (4) Eotvos’s torsion-balance method. Guri- 
ously, the pendulum test, originally the most precise, has been the least satisfac- 
tory. The best was H.H. Potter’s in 1926, who obtained an accuracy of a part in 
10®. Modern free fall experiments by groups from the US, Italy, and Japan give 
results between 1 and 3 parts in 10^®. Much the most accurate tests have been by 
torsion-balance experiments and the celestial method. The best torsion-balance 
results to date by Adelberger et al. have reached 2.4 parts in 10^^ for a variety 
of test bodies. The celestial method was refined by Laplace in 1798 and then 
dramatically reinvented in the 1960s in the context of lunar laser-ranging. Its 
current limit reported by Dickey et al. is 5 parts in 10^®. 

STEP returns to what is essentially Galileo’s free fall method, without the 
sudden stop at the end, rather than attempting to orbit a torsion balance. The 
reason for this choice is in the disturbing effect of gravity gradients on the ex- 
periment. Ever since the work of Roll, Krotkov, and Dicke (and to some extent 
since Eotvos) it has been recognized that the two dominant practical limitations 
on torsion-balance experiments are seismic disturbance and the effects of grav- 
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ity gradients on the mass moments of the balance. In STEP, the seismic effects 
are greatly reduced in comparison with ground-based experiments through the 
use of a drag-free satellite. The gravity gradient effects remain; with a torsion 
balance, they would overwhelm the experiment. Already in the ground-based 
tests the experimenters have had to summon up very considerable ingenuity to 
reduce gravity gradient disturbances. To advance the precision six more orders 
of magnitude another approach has to be found. 

The approach used in STEP depends on recognizing that gravity gradient 
effects enter the experiment in two ways, disturbances originating in the space- 
craft and disturbances originating in gradients from the earth’s gravity. In STEP 
it is possible to exploit the earth-gradient effects to our advantage. With two 
nearly concentric test masses in free fall, the earth-gradients produce a differ- 
ence in acceleration, doubly periodic in the orbit, from which the displacement 
between the mass centers can be determined. The masses can be centered on 
each other very accurately using this acceleration as an error signal. The differ- 
ential acceleration measurement is then insensitive to the earth’s gradient, and 
even more important, it is insensitive to first-order gradient disturbances from 
the spacecraft. One disturbing gravity gradient effect has been used to eliminate 
both. 

2 STEP Concepts 

The simplest concept of STEP is Galileo’s leaning tower experiment, with the 
difference that the test masses fall all the way around the Earth (Fig.l). If there 
is a difference in acceleration, the masses will go into separate orbits, and over 
short times they will separate in proportion to the square of the time of fall. 
For times of the order of one orbit or more, they tend to return periodically 
to their initial positions, with an added secular drift. The net gain is as if the 
experiment were done from a tower several thousand kilometers high, with the 
added advantages that the experiment repeats periodically and is performed in 
a relatively benign environment. 

A practical difficulty with a tower experiment is releasing the masses without 
disturbance, at exactly the same height and exactly the same instant. This is 
overcome in STEP by observing the difference in the masses’ motion. If the 
masses do not start from exactly the same initial conditions, they will likewise 
follow different orbits, but this difference is readily distinguishable from an EP 
violation by being at twice the frequency. This gravity gradient acceleration is 
used to precisely center the masses on one another, as described below. The 
STEP test masses are made in the shape of hollow cylinders so that one fits 
inside the other. This permits their centers of gravity to be precisely aligned, 
eliminating first order disturbances from gravity gradients which would result 
from miscentering. 

Ambiguities in the measurement would result if the masses were allowed to 
move freely in all degrees of freedom. In a practical experiment the masses are 
not completely free, but are constrained to move along a line, the sensitive direc- 
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Fig. 1. Concept of STEP 



tion, which coincides with the cylinder axis. Superconducting magnetic bearings 
with exceptional stability and extremely low losses provide the constraint mech- 
anism. The only constraint along the sensitive direction comes from the position 
measurement. This gives the masses periods of the order of 1000 seconds, slightly 
reducing the amplitude of motion expected from a violation. 

The position measurement is made by SQUID magnetometers which precisely 
measure currents from sensing coils near each test mass. The sensor makes a 
differential measurement between pairs of masses which is not much disturbed 
by satellite motion. Radial and angular vibrational modes of the masses are 
monitored by supplemental electrodes surrounding each test mass, which are 
also used to manipulate the masses by small applied voltages, and to measure 
their electrical charge. 

Air drag would seriously degrade the STEP measurement (as in Galileo’s 
experiment) if it were not compensated by a “drag-free” controller for the satel- 
lite. The motion of the satellite in response to drag is measured relative to the 
test masses, which are protected from the external drag by the satellite itself. A 
servomechanism fires small linear thrusters to correct the motion, cancelling the 
effect of the external drag or other disturbance. Reaction mass comes from boiloff 
of the liquid helium used to refrigerate the experiment, which would otherwise 
be yet another disturbance. 

In orbit most disturbances tend to recur at the frequency of the orbit and its 
harmonics. The characteristic signature of a violation occurs at orbit frequency 
unless the satellite is rotated. In a rotating satellite, the frequency of the violation 
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signal is at the difference frequency between the orbit and rotation frequency of 
the satellite. STEP uses this fact to separate any EP violation from most external 
disturbances. Some disturbances may track the signal because they also depend 
on this difference frequency. We have prepared a program of measurements which 
allows identification of these systematic disturbances by measuring at different 
rotation frequencies and with varied experimental conditions. Details of some of 
these measurement procedures are below. 

3 STEP Instrument Configuration 

The STEP instrument comprises four differential accelerometers, which will be 
operated simultaneously to maximize the quantity and quality of the data re- 
turned. These accelerometers operate at a nominal temperature of approximately 
2 K. Each accelerometer contains two cylindrical test masses which are con- 
strained from moving radially by superconducting magnetic bearings that leave 
them free to move along the axis of the instrument. Motion along the cylinder 
axis is measured precisely by SQUID magnetometers, and a capacitance pickoff 
measures motion in all degrees of freedom at lower resolution. Magnetic force 
from the SQUID sensing coils constrains the masses in the axial direction. The 
SQUID outputs measure both differential mode acceleration and common mode 
acceleration along the cylinder axis. The masses can be manipulated by voltages 
applied to capacitance sensor electrodes surrounding them. The charge on the 
masses can be estimated from the response of the masses to these voltages, and 
it can be controlled by an ultraviolet discharging mechanism. The masses will 
be caged during launch by a hydraulic actuator, similar to the GP-B caging 
mechanism, using pressurized liquid helium as the working fluid. 

Each differential accelerometer (Fig. 2) contains two concentric test masses 
with cylindrical symmetry, each coated with a thin film of superconductor, which 
are independently constrained by superconducting magnetic bearings and which 
move freely along the symmetry axis. The “belted-cylinder” shapes of the test 
masses are picked to minimize higher order gravitational coupling to possibly 
moving masses in the satellite; for example each mass has the same moment of 
inertia about any axis. This makes the masses respond to gravity like a sphere 
to sixth order. 

The SQUID displacement sensors will be able to detect displacements corre- 
sponding to about 3 X 10“^® g of differential acceleration in an averaging time 
of about 20 orbits. SQUIDs which measure the common mode will be sensitive 
to about 1 X 10“^® g over the same time span. Additional details of the SQUIDs 
are below. 

The test masses must be prevented from moving in other than the sensi- 
tive direction to avoid ambiguity in the acceleration being measured; the sense 
coils necessarily have some response to motion in other modes than along the 
cylinder axis. Superconducting magnetic bearings perform this function. A me- 
ander pattern niobium circuit on a quartz cylinder provides the maximum radial 
constraint of each test mass with minimum force along the sensitive axis. This 
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Fig. 2. STEP Accelerometer 



type of circuit produces stiff radical force on the test mass with no net magnetic 
moment to disturb the SQUID. These circuits are divided in quadrants to allow 
adjustment of the test masses’ center of mass positions during setup. 

The accelerometer configuration (Fig. 3) is best described by reference to 
spacecraft coordinates which have X and Y axes in the plane of the orbit and 
the Z axis roughly toward the Sun. The accelerometers have their sensitive axes 
alternately along the X and Y directions, and are stacked along the Z axis. They 
will be separated by about 15 cm in the Z direction. A quartz block supports 
and aligns the accelerometers and associated parts in this configuration. This 
configuration allows all the accelerometers to be centered on the roll axis of the 
satellite which is normally horizontal and perpendicular to the in-track direction. 

In this attitude (Fig. 4) the accelerometers can be operated simultaneously 
with minimum disturbance from the Earth’s gravity gradient. Attitude errors 
will be estimated by using the common mode of the accelerometers to measure 
the Earth’s gradient, and used to orient the satellite. This ensures that any 
disturbance from the Earth’s gradient will be minimized. 

The STEP accelerometers are contained in a dewar, part of a “drag-free” 
satellite (Fig. 5) which completely surrounds the entire instrument, protecting 
it from disturbances such as air drag, magnetic field, and solar pressure. The 
satellite will follow the masses by using linear thrusters which compensate for 
the drag and cause the satellite to precisely follow the masses. The satellite will 
include a radiation sensor to improve the estimate of the masses’ charging rate, 
an external magnetic field sensor, electronics to control the payload, a superfiuid 
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Fig. 3. Quartz Block Concept 



helium dewar incorporating a superconducting magnetic shield and ultrahigh 
vacuum chamber that surrounds the instrument, communications equipment, 
power supply, computer, and other support equipment. 

The essential requirement of the design is to eliminate all disturbances that 
could imitate the signal of a difference in rate of fall. The inherent stability and 
low losses in superconductors reduce the intrinsic sensor noise. Superconducting 
shielding eliminates external magnetic and electrical disturbances. Likewise the 
very low temperature (~ 2 K), temperature stability and uniformity (< 0.5 
mK per orbit), and pressure (< 10“^^ torr) reduce most thermal disturbances 
to insignificant levels. The shapes of the test masses are chosen to minimize 
disturbing gravity forces from the satellite, and the masses will be centered 
within 10“^° m to remove the effect of the Earth’s gravity gradient. A charge 
control system reduces disturbances from particle radiation and the resultant 
electrical charging. 

Active drag compensation ( “Drag Free Control” ) is required to reduce satel- 
lite acceleration by seven orders of magnitude within the measurement band- 
width. The helium gas boiled off from the cryostat, which is otherwise a distur- 
bance, is used as the propellant. This gas is vented through continuously variable 
thrusters developed for the GP-B program. Temperature control of the dewar is 
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Fig. 4. STEP Satellite Concept 



also maintained by the drag free system which will “null-dump” excess helium 
for cooling without changing the net thrust. A sun-synchronous orbit, about 500 
km high, minimizes thermal changes to the satellite. 

To distinguish a EP violation from most remaining disturbances, the satel- 
lite must rotate about an axis (the spacecraft Z-axis) normal to a plane parallel 
to the sensitive directions (or cylinder axis) of the accelerometers (Fig. 5). The 
rotation axis must be horizontal to prevent the accelerometers being disturbed 
by common-mode gravity gradient accelerations from the Earth. The EP sig- 
nal appears at the difference in frequency between the orbit frequency and the 
rotation frequency. The rotation frequency will be changed between measure- 
ments to move the signal frequency away from fixed frequency disturbances. 
The satellite’s rotation axis and attitude will be chosen and controlled to mini- 
mize disturbances from the Earth’s gravity gradients, by observation of the test 
masses. Experimental parameters, for example the electric charge on the test 
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Fig. 5. STEP Orientation and Coordinate Systems 



mass, or the center of mass displacement, can be varied to distinguish a possible 
violation from systematic disturbances. 



Test Mass Choice 

Extensive discussions have led to a baseline design choice of three materials, 
Pt-Ir, Nb, Be, in a cyclic condition, with the Pt-Ir/Be pair duplicated in an ac- 
celerometer with somewhat different design. This choice presumes that sensitive 
comparisons of two materials will be possible only within a single differential 
accelerometer, and that materials in different accelerometers cannot be directly 
compared. The choice of test mass materials was made to maximize information 
return. There is a tradeoff between two competing ideas, first to test as many 
different materials as possible, so as not to miss anything, and second to make 
certain that anything measured is a real effect. The choice is complicated by 
issues of manufacturability, that is, there are only a limited number of scien- 
tifically interesting materials which can be made into a test mass. Uncertainty 
about what might cause an EP violation suggests testing a multiplicity of mate- 
rials. From this viewpoint a number of materials should be chosen from the low, 
middle and high portions of the periodic table. Though theoretical ideas are at 
present barely suggestive, it is expected that those properties which might lead 
to an EP violation will vary more or less smoothly across the periodic table; for 
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example mass deficit or neutron number. It is probably not too critical which 
ones are chosen, so long as they have linearly independent properties. 

Testing whether a measured result is real or spurious involves both repetition 
of the experiment (to test whether the result is accidental) and variation of ex- 
perimental conditions (to test that the result is not a systematic effect) . Ideally 
one would like to test the same materials under many different conditions. In 
that case, if the result is unchanged, it is clear that the differing conditions have 
nothing to do with the outcome. This suggests testing a minimum number of 
materials very carefully. An important verification technique to use with mul- 
tiple materials is the cyclic condition, in which the total acceleration difference 
between three pairs of masses AB, BC, CA must add to zero. The baseline choice 
shown in Table 1 incorporates a cyclic condition and the duplication of a pair of 
materials with a different accelerometer design. This baseline test-mass scheme 
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incorporates two model-independent validity checks: 1) ab, be, ac; and 2) ab, 
(ab)’. The prime indicates that this accelerometer will include certain variations 
in experimental parameters as a further check for systematic effects. For exam- 
ple the ab and (ab)’ accelerometers will be orientated at 90° relative to each 
other and will be physically separated by the length of the quartz block support 
structure. Radial and longitudinal spring constants will be made non-identical 
by setting bearing and readout circuit currents. Similarly, electrostatic position 
sensing voltages and electrode gap spacings will also differ. 

We have verified through discussions with Speedring Inc. and BIPM that 
the three chosen materials can be machined to sub-micron tolerances. Their 
cryogenic properties are known sufficiently well to enable mechanical design and 
thermal analysis to proceed. We have conducted preliminary studies of their Nb 
thin film coating properties and foresee no obstacles in this regard. 



3.1 Test Mass Shapes and Configuration 

To ensure that non-uniform gravitational fields do not cause differences in accel- 
eration of the test masses, their centers of mass must coincide, and their shapes 
and dimensions must be chosen so as to minimize the coupling of higher order 
gravitational gradients to each mass. This approach greatly reduces spurious 
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gravitational effects due to helium tides in spacecraft’s dewar, deformation of 
the spacecraft, etc. An ideal shape for the masses would be spherical, but this 
gives no good way to measure the inner mass, and although spheres can be very 
accurately made, any gravitational advantage is most likely nullified by density 
variations. The STEP test masses will be belted cylinders with their dimensions 
chosen to make the masses act as much like spheres as possible gravitationally. 
Where it is not possible to give the masses zero multipole moments, correspond- 
ing moments of inner and outer masses will be made identical. The shapes of the 
2 masses in each pair separately make the quadrupole, hexadecapole, and 64- 
pole terms close to zero, while minimising the 256-pole gravitational coupling 
term [4] . Therefore the masses ‘look like’ gravitational monopoles to a few ppm 
for gravitational sources as close as 250 mm away [5]. 

3.2 Differential Accelerometer Operation 

The differential accelerometers each contain a pair of test masses constrained by 
magnetic springs. The differential displacement along their common ’sensitive’ 
axis is measured to very high precision (Ri 7 x 10“^^ m in 10^ s integration 
time) using superconducting pick-up coils connected to SQUID detectors. The 
differential displacement is therefore a measure of the difference in acceleration 
experienced by the masses in each pair. Each mass is guided along the sensitive 
axis by a superconducting magnetic bearing. Its equilibrium position may be 
trimmed by adjustment of supercurrents in the bearings and SQUID coils, or by 
a voltage applied to surrounding electrodes. 

SQUIDs (Superconducting Quantum Interference Devices) are chosen for 
STEP because of their extremely high sensitivity and low intrinsic noise (ap- 
prox. 2.6 X 10“^® J/Hz at the signal frequency). An ordinary non-differential 
accelerometer is made by placing one test mass between two superconducting 
pickup coils wired in series, with a persistent supercurrent flowing around the cir- 
cuit (Fig. 6). This circuit is the superconducting equivalent of a common device, 
the linear variable displacement transformer (LVDT). 
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Movement of the superconductor-coated test mass towards one pickup coil, 
and away from the other one, changes the inductance values. Because of flux 
conservation, this forces a current through a third coil in parallel with the first 
two. This signal current is roughly proportional to the displacement, and the 
third coil is the input coil of a SQUID. STEP’S differential accelerometers use 
two such circuits — one for each test mass — connected back-to-back, so that 
they share the same SQUID input coil (Fig. 7). With proper setup currents, the 
SQUID will sense a current proportional to the difference in displacement of the 
masses (and hence to their differential acceleration). 

The accelerometers comprising each test mass pair must be balanced against 
each other to give a true differential signal. This is done by adjusting the magni- 
tude of their respective circulating supercurrents. A supercurrent can be changed 
by destroying the superconductivity of part of the circuit by a heater and then 
restoring it after the current is changed, or indirectly by coupling to a transformer 
which incorporates a superconducting loop as its primary. These are standard 
cryogenic techniques which can be performed with high precision. The satel- 
lite will be accelerated to provide a dither acceleration, and the supercurrents 
adjusted to minimize the resulting differential mode response. The differential 
accelerometers are designed to have have excellent rejection of common-mode 
accelerations combined with exceptionally high sensitivity to differential accel- 
erations. Moreover, they will remain balanced due to the unparalleled stability 
of the persistent supercurrents. 

As previously mentioned, the test masses must be prevented from moving in 
other than the sensitive direction to avoid ambiguity in the acceleration being 
measured. The superconducting bearing circuits which constrain the masses are 
divided in quadrants to allow adjustment of the center of mass positions. This 
setup adjustment prevents coupling of first order gravity gradient forces into the 
measurement. 

The center of mass offset is measured using the gravity gradient disturbance 
itself as an error signal, and the bearings are actively adjusted until the distur- 
bance vanishes. The extremely stable supercurrent used for the adjustment is 
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“frozen” at the end of this setup procedure and then needs no further adjust- 
ment. The controller for this adjustment will be integrated with the drag-free 
controller for the satellite. The residual disturbance from drag-free control lim- 
its the measurement accuracy of the centering to perhaps 3 x 10“^^ m, and will 
cause the masses to vibrate with an amplitude about 1.3 x 10“^° m, averaged 
over one signal period. This amplitude is nearly eight times smaller than the re- 
quirement of 10“® m, and is at the ^ 0.01 Hz resonant frequency of the bearing, 
well removed from the signal frequency. 

It will be necessary to control the masses’ motion both before and after 
the magnetic bearing setup procedure. This manipulation will be done by an 
electrostatic positioning system. Several electrodes will be arranged around each 
test mass, and control voltages applied depending on its position and velocity. 
The positioning system will also capacitively measure the position of each test 
mass in six degrees of freedom, plus a charge measurement. During initialization 
procedures the electrostatic system controls the position of the mass (which must 
be fixed for SQUID setup) and provides positioning information to the drag free 
controller in those modes that the SQUID does not measure. During instrument 
operation the electrostatic system provides two additional functions. It provides 
damping of the motion of the test masses, thereby reducing intermode coupling. 
It also performs charge estimation and control, to reduce disturbances from 
electric charging during SAA passages. The charge measurement is conceptually 
made by applying a dither voltage to which the mass responds only if it is 
charged. This measurement is then used to drive the charge to zero in an active 
control loop. A UV light source, derived from GP-B heritage, allows charge to 
be moved on or off the mass without contacting it. 

During launch the test masses will be held securely by a specially designed 
caging mechanism which will disengage before measurements begin, but can be 
re-engaged at any time should the masses need to be clamped for any reason. 

4 Experiment Operations and Timeline 

We have produced a detailed description of the operations sequence and schedule 
for the experiment, which is summarized below and in Fig. 8. Beyond the setup 
and calibration stage of the experiment, a completely predetermined schedule of 
measurement operations for STEP is not possible. A flexible operations strategy 
is necessary to meet contingencies. This is because the optimum schedule, and 
which operations are to be performed, depends on the conditions actually found 
in orbit. These conditions cannot be known in advance because they depend 
on things such as the degree to which requirements have been met, environ- 
mental factors which cannot be predicted (or have never been measured), and 
possibly overlooked error sources. There is not time enough in the mission to 
perform all possible measurements, so a selection is necessary based on the con- 
ditions actually discovered in orbit. The strategy is therefore to prepare a set of 
measurements to be performed when predetermined conditions are met. Once 
preliminary data on the actual conditions are available, these “canned” proce- 
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dures will be used as needed to identify and calibrate possible systematic effects 
and optimize the system. 

The rough outline of the experiment is therefore as follows. After an initial 
period of satellite commissioning, the drag free system is initialized in a “coarse 
drag-free” mode using capacitance sensor measurements for the reference. The 
differential accelerometers are initialized and calibrated. Really precise calibra- 
tion is not generally required for STEP; the primary calibration reference is the 
capacitance sensor, which depends on the dimensional stability of the electrodes 
for its accuracy. This and the SQUID sensor will be precalibrated to < 1% on 
the ground. The accelerometers will be launched in a shutdown condition; the 
SQUIDs will be set up and recalibrated in orbit by reference to the capacitance 
sensor. The setups which need to be precise are the common mode rejection ad- 
justment and the center of mass adjustment. These adjust internal ratios rather 
than absolute values and are self referenced by nulling or minimizing an output. 

The common mode rejection adjustment changes the scale factors for each 
mass in a given differential accelerometer. The satellite will be accelerated at 
a known low frequency (much less than the longitudinal frequency of the test 
masses) to provide a dither signal in the sensitive direction. The supercurrents 
will be adjusted to produce a null response to this acceleration in the differential 
mode. The supercurrents can be changed in a few seconds, but the time required 
for the adjustment will need to be at least several times the test mass period 
(~1000 seconds) to allow the system to settle. All the differential accelerometers 
can be adjusted simultaneously, so this adjustment can be completed and checked 
in a few (2-5) days. Following the differential mode adjustment the satellite will 
be accelerated at right angles to the sensitive direction, to calibrate the coupling 
due to accelerometer misalignment. If this coupling is known to a few percent 
or better, it can be used in a software correction to improve the overall common 
mode rejection. This completes the adjustment and calibration of the common 
mode rejection ratio. 

The procedure for centering the test masses on one another was described 
briefly above. In summary the gravity gradient acceleration in the differential 
mode is used as an error signal for the center of mass displacement. The center of 
mass displacement is adjusted by changes to supercurrents, and once the masses’ 
equilibrium positions are centered on each other, they should remain centered. 
The masses are damped in the radial modes by the electrostatic positioning 
system, and are excited only by residual spacecraft motion, so that the masses 
never move very far from their equilibrium positions. No on-orbit calibration is 
necessary since both error signal and displacement go to zero together. 

An initial measurement is made with the satellite inertially oriented. This is 
a reference measurement which can be taken immediately after setup. The signal 
frequency is equal to the orbit frequency which contains many disturbances that 
may mask an EP signal. The information is primarily for characterization of the 
disturbances, so that it will be possible to make a rational choice of the roll rate 
for subsequent measurements. 
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Fig. 8. STEP Mission Timeline 



The EP signal appears at the difference frequency between the roll and orbit 
frequencies. Its phase is such that zero amplitude occurs when an accelerometer’s 
sensitive axis is horizontal. Its amplitude (measured as an acceleration) is inde- 
pendent of roll rate. For an EP measurement, data will be taken for periods of 
20,000 to 500,000 seconds, and at roll rates from —3 to -1-3 times orbit frequency 

1.8 X 10“^Hz), or at signal frequency from —2 to -1-4 times orbit rate. EP mea- 
surements will be repeated with different roll rates and phases throughout the 
mission, at rates which avoid the frequencies of large disturbances. These tests 
will be intermixed with tests for systematic disturbances, which are discussed in 
the next section. 

When the remaining helium is down to about ten day’s supply, the experi- 
ment will enter a post-measurement phase which remeasures some of the items 
measured during calibration to see if they have changed, and adds a few tests 
which could not be done earlier because they might permanently change the 
setup. In the terminal phase, after the helium is gone, the satellite loses drag- 
free and attitude control and rapidly warms to a temperature where no further 
tests are possible. 

We estimate about a year will be necessary to completely analyze the data. 
The analysis will be performed by two teams of scientists in Europe and America. 
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4.1 Test for Systematic Effects 

Whether or not there is an EP signal it will be necessary to make tests that 
give confidence in the results. There are two possibilities for judging whether an 
apparent EP signal may be due to a disturbance, first by attempting to make 
the disturbance bigger, so that its effect becomes clear, or second, if this is not 
possible, measuring the disturbance and correlating it with the measurement 
output. In some cases the effect of a disturbance may be defined well enough 
to allow it to be calibrated out of the data, possibly giving an improvement in 
performance. 

We can design tests for disturbances that can be anticipated, such as tem- 
perature change. This includes most if not all of the disturbances in the error 
analysis. The plan is twofold: we will have a prepared list of tests to be performed 
if given conditions exist, and in any case we will perform as many different tests 
as the schedule allows. Decisions on which tests to perform must be made in real 
time, picking from the prepared lists. In all cases as many experiment variables 
as possible should be measured simultaneously. 

What follows is a selection from the list of tests we can perform. Except 
in case of the bearing force determinations (which map background forces) the 
tests are performed like an EP measurement, but with variation of one or more 
of the indicated parameters. That is, the indicated parameters are changed but 
the rest of the procedure is identical to an EP measurement. Note that many of 
these test for an entire class of disturbance, rather than any specific one. 



Bearing Force Determination 

The force from the magnetic bearings and environment can produce disturbances 
by varying in time, and by strong nonlinearities converting the drag free resid- 
ual into the measurement bandwidth. The force is measured by balancing the 
unknown forces from the bearings and environment against the electrostatic po- 
sitioning system. Magnetic and electric forces can be separated to a large extent 
by varying the bearing setup currents and charge on the mass. The purpose of 
this measurement is to verify that the forces on the test masses meet require- 
ments, to measure how much of the total force is magnetic or electric, and to see 
if the forces have changed. This measurement should be made at least once at 
the beginning of the mission and again at the end. It should be repeated if there 
is any evidence of change, for example if the masses’ resonant periods change for 
some reason. 



Gravity Gradient Sensitivity Measurements 

Gravity gradients are a source of concern because they are potentially large rela- 
tive to the effect sought, and cannot be shielded. They are measured by offsetting 
the mass centers of the masses or by offsetting the common mass center and cor- 
relating the resulting acceleration with environmental factors. Also, comparisons 
between accelerometers will provide very sensitive measures of both local and 
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Earth gradients because of the long fixed baseline. These measurements test sen- 
sitivity of the experiment to changes in satellite and/or earth gravity gradients. 
They should be performed if there is an irreducible, significant second harmonic 
component to the differential acceleration, or if correlations between the common 
modes of separate accelerometers suggest that the satellite is changing shape. 

Electric Charge Sensitivity Measurements 

These measurements test that electrical forces are within requirements, how 
much of the total force is electrical, measure how much the charge contributes 
to the noise, and how much disturbance could be caused by the charging effect 
of particle radiation. For these tests the charge on the mass is controlled to a 
relatively large value (>0.5 volts) and the effect on the acceleration measurement 
measured. They should be performed if there is any correlation between the 
differential signals and the measured residual charge or radiation dose. 



Electrical Potential Sensitivity Measurements 

The voltages on the sense electrodes for the electrostatic measurement and 
charge control system are potentially a source of disturbance, although the elec- 
trodes and measurement system are designed to minimize this. The disturbance 
results from both the magnitude of the voltages and voltage noise present on 
them. These tests increase the voltages and vary them while measuring the re- 
sulting accelerations. These measurements should be performed at least once 
during the mission. 



Temperature Variation 

Classically, changes in temperature are a major cause of disturbances. The in- 
strument and satellite contain heaters for applying thermal gradients and chang- 
ing local and global temperatures. These measurements test the coupling of 
temperature to the experiment through the mechanisms of gas pressure effects, 
penetration depth changes, thermal expansion, electronic drift, etc. It may not 
be possible in all cases to get a clean separation of the different coupling mecha- 
nisms. These measurements should be made at least once during the mission, or 
if there is any correlation between measured temperatures and the differential 
acceleration. 



Test Mass Temperature 

The masses are thermally isolated from their housings and heated by particle 
radiation, so that their temperature may rise a few Kelvin during the mission 
duration. We have not so far found any disturbances caused by a uniform tem- 
perature elevation of the test masses; the known disturbances are driven by ther- 
mal gradients at or near the signal frequency. The particle radiation is highly 
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penetrating and heats the masses uniformly. The masses will have very short 
thermal relaxation times of a few tens of milliseconds or less, so any thermal 
gradients caused by this small uniform heat input will be both insignificant and 
transient. Because the temperature of the masses is not directly monitored, if 
a disturbance from this source is suspected, the masses can be recaged and the 
temperature rise of the caging mechanism measured to estimate the temperature 
of the masses. A direct noncontacting measurement of the mass temperature, 
although of interest, is expected to be very expensive. 



Satellite Motion 

Motion of the satellite relative to the test masses is a major determinant of 
the performance of the experiment, since it directly determines the ultimate 
limit to all displacement measurements and directly affects many subsystems. 
Indirect effects include gravitational coupling of the satellite to the differential 
measurement. Two types of satellite motion disturbance may be distinguished. 
The most important motion disturbance is at low frequencies (< 2 Hz), which 
is due to the error in response of the drag free system to variations in residual 
drag. Higher frequency acoustic noise might be generated by overlooked internal 
sources (monitored by a microphone) or micrometeor impacts. 

Sensitivity to satellite vibration can be directly checked by applying a known 
vibration with a PZT or other acoustic source. Measurements testing the masses’ 
coupling to satellite motion will be performed at least once at the beginning of 
the mission during the common mode rejection setup. They will be repeated 
after SQUID setup changes, or if there is any correlation of the differential mode 
with the drag free error signal, or after changes in satellite rotation if the res- 
onant frequencies of the masses differ significantly (in which case changes in 
rotation affect their equilibrium positions). These measurements are made by 
accelerating the satellite in a known way and measuring the resulting output 
from the accelerometers. For example, the quality of common mode rejection 
will be measured by accelerating the satellite at a frequency near the signal fre- 
quency. The common mode will then be readjusted if required, by changing the 
setup currents in the position sensors and magnetic bearings. 

Other important tests related to satellite motion include switching to dif- 
ferent drag free reference masses, applying bias accelerations, and changing the 
form of the control law. These will measure the sensitivity of the experiment to 
assumptions about the drag free control and its interaction with the environ- 
ment. 



Magnetic Shielding 

External magnetic fields can penetrate into the instrument through flaws in the 
superconducting shield. This directly affects the SQUID position sensor mea- 
surement, adding a fictitious displacement to the real one. Large field leakage 
can also apply forces to the test masses. We will test the quality of magnetic 
shielding at least once during the mission, or if there is any correlation with the 
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external magnetometer output. The external field can be changed by exercising 
the magnetorquers, or the SQUID outputs can be correlated with the measured 
external field. 



Particle Radiation 

Particle radiation has three effects: charging the masses, heating them, and push- 
ing them around by momentum transfer. 

Measurements of the charge are planned continuously during the mission; 
these measurements test the integrity and reliability of the charge estimate from 
the measured dose. Charge measurements will be performed automatically as 
part of the standard data set, and will be automatically compared with the 
output of a radiation sensor to estimate the charge delivered by a given radia- 
tion dose. This estimate will be updated continuously in a convergent feedback 
loop. The effectiveness of this procedure for determining sensitivity to charging 
from particle radiation can be measured by passing through the South Atlantic 
Anomaly (SAA) in different orientations, which changes the shielding factor. 
Note that the charge on the test masses will be systematically and deliberately 
varied as part of the electric charge sensitivity measurements. 

Particle radiation is highly penetrating and heats the masses uniformly. They 
are highly conducting and have a significant thermal mass relative to the small 
power in this source. A temperature rise of the order of 1 mK for each pass 
through the SAA is expected. The effect of this was discussed in the paragraph 
on test mass temperature. 

Particles actually stopping in the test masses will transfer their momentum to 
them, and since the proton flux may be expected to be predominantly downward, 
there will be a net downward acceleration during SAA passage of the order of 
10“^® m/s^. This small disturbance will be correlated with the proton ffux, 
and not with the signal which is at a different frequency. The corresponding 
disturbance from cosmic rays is at signal frequency, but is about a thousand 
times smaller. 



Optical and Thermal Radiation 

A UV light is used for charge control, and the momentum and heating of the 
light can disturb the measurement. The sensitivity of the instrument to UV 
light stability and power will be measured at least once near the beginning of 
the mission. 



Helinm Tide 

The EP experiment is extremely sensitive to disturbance from gravity gradients 
from moving masses. The largest potentially movable mass in the satellite is the 
superffuid liquid helium used for refrigeration. If free to move, its surface would 
be excited by gravity gradients which have a harmonic relationship to the signal 
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frequency, resulting in a static distortion of the order of a centimeter, and possi- 
bly dynamic (wave) distortion as well. The helium is planned to be constrained 
by trapping it in a foam or aerogel. This restricts the motion by surface forces 
(or fountain pressure) much larger than the gravity gradients (See the results 
described in Section 5) . The remaining motion will cause negligible disturbance. 
Constrained or not, it is important to be able to confirm that the helium is not 
disturbing the accelerometers. These measurements test the sensitivity of the 
instrument to motion of the liquid helium. 

The amplitude of the disturbance from helium tide is expected to vary 
strongly depending on the amount of helium in the dewar and its distribu- 
tion. The signature of a helium tide is expected to include significant fourth and 
higher harmonics of the signal frequency, and those harmonics will vary system- 
atically in amplitude and phase relative to the signal frequency. This would be 
accompanied by a difference in common mode acceleration between accelerom- 
eters indicating a mass shift on board the satellite that doesn’t correlate with 
satellite temperature. 

Several sorts of tests can be performed. The first sort of test is to change the 
satellite rotation frequency. Because the helium tide has a (largely unknown) 
time-dependent response to Earth’s gravity gradient, this will in general in- 
troduce a measureable phase and amplitude shift in the data. This response 
will vary systematically with helium level, and change slowly during the mea- 
surement. The next sort of tests would compare the common mode outputs of 
different accelerometers. The comparison would be highly sensitive to motion of 
nearby masses including helium tide. Third, introducing a static displacement 
between the masses in each accelerometer will enhance the test, possibly allow- 
ing some localization of the disturbance by measuring higher gradients. Finally, 
tests which apply heat or thermal gradients to the helium bath may change 
the distribution of helium within the constraining foam, by fountain effect, and 
therefore will change the helium tide response. 



Mass Dynamics 

Similar to the case of satellite motion, movement of the test masses relative 
to their housings can cause various disturbances. These disturbances include 
calibration changes, real and apparent coupling between modes of motion, and 
saturation of measurement systems. The masses’ motion will normally be elec- 
tronically damped to very low amplitudes in all degrees of freedom, using the 
electrostatic positioning system. A particular concern is rotation about the cylin- 
der axis, which might be difficult to measure and control. We have shown that 
test mass rotation can be sensed and controlled using the residual out of round- 
ness of the cylinders (~ 1 /rm). 

Measurements involving mass dynamics test the accelerometers’ sensitivity 
to motion in modes other than the sensitive axis, including cross-coupling and 
frequency conversion. The masses will be displaced and excited in different modes 
by the electrostatic positioning system and magnetic bearings, while observing 
the accelerometer outputs. 
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5 Error Analysis and Mission Tradeoffs 

STEP is necessarily a highly optimized experiment with many tradeoffs, and 
sections cannot simply be split off for separate analysis without the risk of in- 
consistency. Most error sources were evaluated analytically for the purpose of 
setting requirements in the original STEP Science Requirements Document (May 
1990) and the analysis has been continually updated to the present day in an 
increasingly systematic and consistent way. 

Early error calculations, sometimes performed by independent workers, tend- 
ed to use independent assumptions for each error source resulting in requirements 
which were not entirely self-consistent. To eliminate these inconsistencies in as- 
sumptions and requirements we have developed a computer program which au- 
tomatically calculates the errors and resulting tradeoffs from consistent starting 
assumptions. 

The starting point for all the computer programs is the STEP Error Anal- 
ysis, a text document actively maintained in our database. The STEP Error 
Analysis includes estimates of specific disturbances to the masses in the gen- 
eral categories of thermal noise, gas pressure forces, electrical forces, magnetic 
forces, gravitational forces, radiation pressure, and vibration. It also includes 
estimates of disturbances to the measurement system, including measurement 
noise, changes to the superconductors, and thermal and mechanical stability. 
Specific disturbances considered include electric potentials in the housing sur- 
rounding the mass (including patch effect), the radiometer effect, losses from 
eddy current damping, gravitational coupling from helium tide, drag free resid- 
ual vibration coupled to the differential mode, SQUID noise, penetration depth 
changes in superconductors, and momentum transfer from penetrating particle 
radiation. The Error Analysis also lists interactions and tradeoffs within the 
experiment. 

We also maintain an error analysis program, broadly based on the STEP 
Error Analysis. The goal is to embody a model of the experiment and appa- 
ratus (especially including tradeoffs) as a self-consistent whole. The extensive 
model of the accelerometers automatically calculates the common mode balance 
procedure, and the drag free and charge control laws are physically realizable 
models which will be updated with the actual control laws when those are fi- 
nalized. The program systematically evaluates the known errors starting from 
a relatively small and consistent set of low level assumptions, and because it 
efficiently and explicitly incorporates the major tradeoffs it can be used to set 
requirements. Many if not most experiment parameters are explicitly dependent 
on other parameters or requirements; some simple examples are the mass of a 
test body from its dimensions and material, or the signal frequency from orbit 
height and satellite spin rate, or less directly the common mode rejection ratio 
from the setup currents and coil dimensions in the SQUID position sensor. The 
program automatically calculates these, preventing much of the inconsistency 
that results from independent assumptions. Two examples of the utility of this 
approach are given below, for the effects of orbit height and satellite rotation on 
the experiment. 
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5.1 Orbit Height Effects in the Model 

Figs. 9 and 10 show sample results for two different orbit heights, 500 and 400 
km respectively, with no other changes. Fig. 9 is close to the STEP baseline mis- 
sion. In this example the performance is dominated by two noise noise sources, 
SQUID noise and Nyquist noise. SQUID noise is the inherent flux noise of the 
SQUID sensor, converted into an equivalent acceleration. Nyquist noise is the 
fundamental thermal noise limit, likewise converted. 

Some differences due to orbit height are readily apparent, especially in the 
disturbance “dynamic CM offset” disturbance which is greatly increased at the 
lower altitude relative to the other disturbances. Vibrations of the test masses 
about their equilibrium positions result in changing center of mass offsets that 
can couple gravity gradients into the signal frequency. These vibrations are 
caused by residual satellite motion. The drag free residual sensed in the dif- 
ferential mode of the accelerometers {drag free residual in diff. mode) is a direct 
limitation to the mass centering performance, and this residual is increased by 
the overall increase in drag in the lower orbit. Consequently the centering system 
cannot work so well in a lower orbit, and the disturbance increases rapidly with 
the centering error. There is a similar cause for the much smaller increase in 
the disturbance from electrical charge. The charge control system’s performance 
{Electric charge) is also determined by the overall drag free residual, so that the 
charge cannot be controlled so well in a low orbit. This cancels the expectation 
that the charging problem would be less because the radiation decreases in lower 
orbits. The smallest charge that can be measured depends on the residual accel- 
eration noise from drag, and the charge cannot be controlled better than it can 
be measured. In the model used for the radiation flux and its interaction with 
the test masses, the increase in drag noise is proportionally somewhat larger 
than the decrease in the disturbing flux, within the range of likely altitudes for 
STEP. 

5.2 Satellite Rotation and EP Signal Detection in the Error Model 

The ability to shift the signal frequency away from the frequency of disturbances 
is extremely important. Many disturbances, for example that due to particle 
radiation, occur at the frequency of the orbit or its harmonics. Moving the signal 
away from these frequencies may result in a significant improvement in signal to 
noise. 

The disturbances of most concern occur at the signal frequency, the differ- 
ence between orbit rate and rotation rate. For these disturbances, the rotation 
rate cannot have a direct effect because the disturbance tracks it. It may hap- 
pen that indirect effects of rotation are important. One such case is when the 
centrifugal force of rotation is comparable to the spring constant restraining the 
test masses. Centrifugal force is a direct effect of rotation, and adds directly 
to the spring constants. However, the spring constants are determined by the 
set-up conditions, which tend to cancel the direct effects of the rotation. It is 
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DFC Reference accelerometer Systematic component at signal frequemy 

Disturbance m/sec^2 Comment 



SQUID noise 
SQUID temp, drift 
ihermai expansion 
Dfferential Thermal expansion 
Nyquist Noise 
Gas StreaminQ 
Radiometer Effect 
Thermal radiation on mass 
Var. Discharge ur light 
Carth field leakage to GQUID 
Earth Field force 
Penetration depth change 
Electric Charge 
Electric Potenfai 
Sense voltage of^et 
Dragfree residual in dff. Mode 
Viscous coupling 
Cosmic ray momentum 
Proton radiation momentum 
dynamic CM offset 
static CM orrset imit 
Trapped flue drfl acceleration 
Trapped flue changes in squid 
S/C gradient + CM offset 
rotation stability 

EtcsnIriciV subharm @si7ial troq. 




acceleration equivalerri to intrinsic noise 

regulation of SQUID carriers 

gradient along LAC structure 

Radial gradent in DAC structure 

RMS acceleration equivalent 

decayinq Gas flow, outqassinq 

gradient along DAC structure 

Radiation prossure, gradient 

unstable sou'ce, opposite angles on masses 

estimate for signal frequency component 

estimate for signal frequency component 

longitudinal gradient 

Assumptions about rate 

variations h measurement voltage 

bias offset 

estimated from squid noise 

gas drag *■ damping 

mostly directed downvvard 

^bidirectional, downward 

vibration about setpoint, converted 

A/D saturation by 2nd harmonic gg 

actual force fromintemal field stabiliV 

apparent motion from internal field stabiliV 

cavity gradient coupling to DFC residual of S/C 

centrfugal force variation + offset from axis 

incluctes bandwidth imitations 



position sensor gap, mm 1.00 

common mode period 1474 

differential mode period 1135 

S/C rotation per orbit -2.72E+00 

Total error 9.74E-18 



500000 Orbit height 
a0086 sensor current. A 
2.6E-10 CM distance, m 

RMS error 3.36E-18 rrfMC*2 



Fig. 9. Typical Errors for a 500 km orbit. Noise sources known to be incoherent, e.g. 
thermal noise, are converted to an expectation value at the signal frequency by RMS 
averaging. Coherent errors (e.g. some temperature driven effects) are calculated at 
signal frequency. A worst case estimate is made by simply summing the errors. For 
a statistical estimate, the errors are combined in an RMS sense. The bar chart is 
automatically rescaled in proportion to the total disturbance. 



the changed set-up conditions which affect the sensitivity of the SQUID sensors 
(and the experiment) rather than the rotation directly. 

The centrifugal force from the rotation has a tendency to destabilize the 
magnetic springs (from the position sensor) that hold the masses in place. This 
is the reason for holding the rotation to less than three times orbit rate; the 
springs are required to give a period of 1500 seconds or less to provide some 
margin of stability. In the frame of the satellite the centrifugal force is constant 
except possibly for small changes due to variation of rotation rate. The constant 
centrifugal force is easy to distinguish from EP signals. The centrifugal force 
disturbance due to changes in rotation rate w is (uj(5(uj) — AX + o;^d(Z\X))Cmrr, 
where AX is the distance from rotation axis and d() indicates the variation in 
a quantity. 5 {AX) represents the amplitude of mass motion about AX. Varia- 
tions in rotation rate 5(u;) change the centrifugal force for both masses, and are 
reduced in the differential mode by the common mode rejection ratio Cnirr- For 
Lo = —0.001 rad/s {^ -orbit frequency), 5(o;) O.Olu;, AX ^ 7.5 x 10“^ m, and 
5{AX) 1.3 X 10“^^ m at signal frequency, this disturbance is about 3 x 10“^^ 

m/s^. 
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DFC Reference accelerometer 


Systematic component at signal frequemy 




Disturbance 


m/sec*2 




Comment 


SQUID noise 


2.31 E-18 






acceleration equivalert to intrinsic noise 






SQUID temp, drtn 


9.41E-19 




□ 


regulation of SQUID carriers 


ihermai expansion 


8.U2h-iy 




—I 


gradient along LAC structure 


Dfferential Thermal expansion 


5.08E-23 






Radial gradent in DAC structure 


Nyquist Noise 


1.18E-18 




Z=l 


RMS acceleration equivalent 


Gas StreaminQ 


3.39E-19 




n 


decayina Gas flow, outaassinq 


Radiometer Effect 


9.07E-19 






gradient along DAC structure 


Thermal radiation on mass 


1.88E-22 






Radiation pressure, gradient 


Var. Discharge ur light 


3.48E-19 




] 


unstable sou'ce, opposite angles on masses 


[larth field leakage to GQUID 


4.24C-19 




3 


estimate for signal frequency component 


Earth Field force 


5.66E-23 






estimate for signal frequency component 


Penetration depth change 


3.29E-20 






longitudinal gradient 


Electric Charge 


8.86E-20 






Assumptions about rate 


Electric Potenfai 


5.85E-19 




□ 


variations h measurement voltage 


Sense voltage of^et 


3.77E-20 






bias offset 


Dragfree residual in dff. Mode 


2.16E-19 




D 


estimated from squid noise 


Viscous coupling 


2.01 E-23 






gas drag *■ damping 


Cosmic ray momentum 


3.39E-21 






mostly directed downvvard 


Proton radiation momentum 


3.05C-20 






^bidirectional, downward 


dynamic CM offset 
static CM orrset imit 






! 


vibration about setpoint, converted 
A/D saturation by 2nd harmonic gg 


1.70E-19 






Trapped flue drfl acceleration 


7.43E-23 






actual force fromintemal field stabiliV 


Trapped flue changes in squid 


3.50E-19 




LI 


apparent motion from internal field stabiliV 


S/C gradient + CM offset 


5.62E-31 






cavity gradient coupling to DFC residual of S/C 


rotation stability 


5.87E-24 






centrfugal force variation + offset from axis 


EtcsnIriciV subharm @si 7 ial trsq. 


1.68E-19 




D 


incluctes bandwidth imitations 










position sensor gap, mm 


1.00 




4000X Ortxt height 


common mode period 


1486 




a0086 sensor current, A 


differential mode period 


1142 




1.4E-09 CM distance, m 


S/C rotation per orbit 


-2.72E+00 






Total error 


1.45E-17 


RMS error 6.38E-18 rtyMc*2 



Fig. 10. Typical Errors for a 400 km Orbit 



The drag-free and attitude control system controls all six degrees of free- 
dom, including rotation. If uncontrolled, the satellite will rotate about some line 
through its center of mass rather than about the line joining the centers of the 
test masses. The attitude control system forces the satellite to rotate about the 
line through the centers of the test masses (the “rotation axis”), using linear 
thrusters to provide the necessary forces and torques. To do this, the system 
must apply a constant thrust in the rotating frame (with some perturbation 
from disturbances), directed toward the axis and proportional to the distance 
between the satellite’s center of mass and the rotation axis. Enough thrust is 
available for this if the satellite center of mass is within 10 cm of the rotation 
axis, but some thrust must be reserved for translational control. The satellite is 
reasonably symmetric about the rotation axis, and there will be a requirement 
to passively trim the center of mass to within less than 1 cm of the axis. This 
leaves 90% of the thrust available for translation. Lockheed Martin has facilities 
for measuring the center of mass of satellites (specifically GP-B) to within 2.25 
mm about the roll axis. Trimming to reduce the satellite’s products of inertia 
may also be needed. These can be measured in the same facility to within 5 
kg-m^ . 

For many disturbances, neither the rotation rate nor the signal frequency is 
a direct input to the result of the error analysis, or it has a small effect. For 
disturbances which depend directly on the orbit rate, the error analysis assumes 
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DFC Reference accelerometer 


Systematic component at signal frequency 




Disturbance 


m/secA2 




Comment 


SQUID noise 


2.9CE-18 






acceleration equivalent to htriiisic noise 


SQUID terrvx drifi 


1.07E-18 




■ 


regulation of SQUID carriers 


inermai expansion 


«.yyt-iy 




1 


gradient along Li^c aructure 


Dffe rent iai Thermal expansion 


5.08E-23 






Radial gradient ii DAC stucture 


Nyquist Noise 


1.10E-18 




■ 


RMS acceleration equivalent 


Gas Strearriia 


4.01E-19 




1 


decayinq Gas flow, oulaassinq 


Radiomet^ Effect 


8.47E*19 




1 


gradient along CMC structure 


Thermal radiation on mass 


1.75E-22 






Radiation pressure, gradient 


Var. Discharge uv light 


3.25E-19 






unstable source, opposite angles on masses 


Earth Field force 


2.23E-21 






estimate for signal frequency component 


Penetration depth change 


i4TE-20 






longitudinal gradient 


Electric Charge 


1.14E-18 




■ 


Assumptions about rate 


Electric Potential 


5.46E-19 




1 


variations in measurement voltage 


Sense voltage offset 


2.60E-18 






bias offset 


Drag free residual indrff. Mode 


lOOE-20 






estimated from squid noise 


Viecoue coupling 


4.22E-24 






gas drag * damping 


Oosrric ray momentum 


116E-21 






mostly directed downward 


Proton radiation momentum 


2.09C-10 






inidirectional, downward 


dynanric CM offset 


9.87E-19 




■ 


vibration about setpoht, converted 


static CM orrset imit 


1.95E-20 






A/D saturation by 2nd harmonic gg 


Trapped fliK drift acceleration 


a94E-23 






actual force from Internal field stability 


Trapped fliK c hanges in squid 


197E-19 






apparent motion from internal field stability 


S/C gradient + CM offset 


1.61E-32 






craviV gradient couping to DFC residual of S/C 


rotation stability 


2.45E-22 






centrifugal force variation * offset from axis 


EtcenIriciV subharm @signal freq. 


5.63E-20 






includes bandwidth linnitations 


position sensor gap. mm 


1.00 




500000 Orbit height 


common mode period 


1369 




0.0086 sensor current, A 


differential mode period 


1071 




1.8E-10 CM distance, m 


S/C rotation per orbit 


-2.00E*00 






Total error 


a45E-17 


RMS error 1.97E-17 mfsec*2 



Fig. 11. Effect of Rotation Choice 



they have harmonics at multiples of the orbit frequency; in many cases the 
amplitudes of these harmonics can be explicitly calculated, and in the others a 
conservative estimate is used. The frequency of disturbances and their harmonics 
influences the choice of rotation rates, which should not be multiples of the orbit 
frequency. 

The expected effects of a change in rotation from —2.718 times orbit fre- 
quency (the previous examples) to —2.000 times orbit frequency are shown in 
Fig. 11. The most apparent changes are a large increase in the disturbance from 
magnetic field penetration and smaller increases in electric charge effects and the 
momentum transfer from proton momentum. The magnetic field of the Earth 
changes direction at twice orbital frequency in orbit. Its fourier spectrum is made 
up of narrow lines at harmonics of the orbit frequency, and this particular choice 
of signal frequency coincides with one of them. Note that the narrow line width 
of regular disturbances such as magnetic field penetration is determined by the 
observation time, while their own structure determines the relative amplitude 
of their harmonics. Regular disturbances with short durations relative to the 
orbit, such as proton momentum transfer, are spread out over more harmonics, 
but have little energy away from multiples of the orbit frequency. Thus, small 
changes away from orbit harmonic frequencies may make a large difference in a 
disturbance. 
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The changes in electrical disturbances are related to the regular passage 
through the South Atlantic Anomaly. In this case the rotation rate of —2 times 
orbit frequency means that every third passage through the anomaly comes with 
the same phase. Because passage through the anomaly is short, its energy is 
spread over many harmonics, and the third harmonic changes the charge in time 
with the signal frequency. The relatively slow operating charge control system 
cannot correct the charge quickly enough to reduce the disturbance. 

A selection of possible roll rates, and reasons for selecting them or not based 
on our analysis, is given in Table 2. 

These two examples illustrate the importance of having an integrated model 
of the full system, in understanding the results. The effect of other errors can be 
similarly understood. Several of the most significant error sources are discussed 
below. 



Table 2. STEP Satellite Roll Rates 



Roll Rate, (0 
(units of the 
orbital rate, O),) 


Signal 
Frequency 
COs =cq,-co 


Error sources or Experimental 
advantages 


Purpose 


Comment 


2.718 


-1.718 


EP signal is not an integer 
multiple of (0„ 


EP measurement 


moderate (Og. 


2.142 


-1.142 


EP signal is not an integer 
multiple of (Oo 


EP measurement 


relatively low (O^ 


2.000 


-1.000 


EP signal frequency coincides 
with (Oo 


May be used for 
assessing the level of 
orbital frequency 
noise 


Avoid for EP measurements 


1.333 


-0.333 


EP signal has 1:3 relationship 
with orbital eccentricity- 
induced error sources 


May be used to assess 
the level of orbital 
frequency noise 


Avoid for EP measurements 


1.000 


0 


EP signal suppressed 
(DC) 


Measure noise sources 


Also suppresses errors from 
thermal radiation from Earth 
and Earth gravity gradients 


0.667 


0.333 


EP signal has 1:3 relationship 
with orbital eccentricity- 
induced error sources 


May be used for 
assessing the level of 
orbital frequency 
noise 


Avoid for EP measurements 


0.500 


0.500 


EP signal frequency is 
identical to satellite roll rate 
and Vi orbit rate 


May be used to assess 
errors tied to orbit and 
roll rate 


Avoid for EP measurements 


0.000 


1.000 


EP signal coincides with orbit 
frequency 


May be used to assess 
the level of orbital 
frequency noise 


Avoid for EP. 

Centrifugal force disturbances 
suppressed. 


-0.414 


1.414 


EP signal is not an integer 
multiple of (Oo 


EP measurement 


low (Os- 


-2.000 


3.000 


EP signal is at 3*^^^ harmonic of 
orbit 


Assess level of 
disturbances 


Avoid for EP measurements 


-2.142 


3.142 


EP signal is not an integer 
multiple of (Oo 


EP measurement 


high (Os 


-2.718 


3.718 


EP signal is not an integer 
multiple of (Oo 


EP measurement 


very high (4. 
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SQUID Noise 

A well designed experiment will be limited by sensor noise if not by a more 
fundamental disturbance. This is the case in STEP, where the very tiny intrinsic 
flux noise in the SQUID of 1 x 10“^° J /Hz is equivalent to an acceleration noise 
5a, of about 5 x 10“^^ m/s^/-\/Hz depending on setup conditions. This then is the 
limit of performance of the drag free control system, which cannot distinguish 
between the sensor noise and an actual drag variation. To compare the acceler- 
ations of two test masses to 10“^^ m/s^ in the presence of this (relatively) large 
disturbance it is necessary to make a differential measurement with common 
mode rejection no bigger than 10“^. This high common mode rejection 

is achieved by accurate manufacture of the accelerometers combined with the 
common mode adjustment procedure previously described. 

Measured STEP SQUID noise characteristics are used to model the accel- 
eration sensitivity in the STEP error analysis. The transfer function relating 
SQUID noise to acceleration sensitivity depends on readout circuit inductances, 
inductance derivatives, circuit setup currents, and test mass masses. See [6] for 
details of the derivation. The measured SQUID noise and analysis based on the 
baseline instrument design yield an acceleration sensitivity that meets the resolu- 
tion requirement of 2.7 x 10“^® ms“^ in a bandwidth of 8 x 10“® Hz. Figs. 12-14 
show the measured properties of GP-B SQUIDS, which are identical with the 
STEP baseline choice. 

The SQUID noise has consequences beyond the immediate acceleration dis- 
turbance. Two other systems depend on the acceleration performance of the 
drag-free system for their operation. These are the charge control system and 
the mass centering system. Because of the close dependence of the performance 
of these systems on acceleration noise, the overall performance of STEP may be 
degraded as a high power of the residual acceleration. 

As mentioned above, the smallest measureable charge in the proposed system 
depends on the drag free residual acceleration. The charge measurement system 
works (conceptually) by measuring the acceleration of each test mass in response 
to a dither voltage. Any residual satellite motion appears as an apparent charge 
M5a/E for a mass M, acceleration noise Sa, and average applied electric field 
E. E is limited to small values (« 0.1 volt per mm) by the requirement to reduce 
disturbances from variations in the applied potentials and induced charges on the 
test masses, so the acceleration noise is a limiting factor to the performance of the 
charge measurement. Charge on the test masses is controlled by a system derived 
from GP-B which uses ultraviolet light and a bias electrode to steer electrons on 
or off of each mass. Incorporated into an appropriate control loop, this provides 
the best possible control of the charge, limited by the charge measurement noise 
q = MSa/E. This produces two sorts of disturbance to the EP measurement, 
one proportional to (or 5a^), and one proportional to q times any electric field 
that may be present. 

The ability to center the masses in each accelerometer depends on the ac- 
celeration noise in the differential mode. The differential mode acceleration is 
used to monitor the center of mass separation, with the Earth’s gravity gradient 




240 



N. Lockerbie et al. 



SQUID Pll vt Tm« FV«: «m31b 




Fig. 12. Measured Output, in units of ^o, GPG Science Mission Qualified SQUID 
smSlb vs. time. 



driving it. The centering can be no better than when the error signal from grav- 
ity gradient is the same size as the noise in the differential mode, which is the 
residual satellite motion times the common mode rejection. The static miscen- 
tering measureable by this technique in principle is quite small; at orbit radius 
R = 6870 km, g is about 8.43 m/s^, and for observation time Tq = 100,000 s, 
and Cmrr = 10“"^, the measureable miscentering is ((To/27r)^/^i?(5aC'mrr)/5 ~ 
3 X 10“^^ m. This is not likely to be achieved because the masses’ common modes 
will be excited by the acceleration noise, to an average amplitude 5aj {2 tt « 
10“^° m, where Tf, is the radial period of the magnetic bearings. It is this latter 
displacement which is converted to signal frequency by mixing with the Earth’s 
gravity gradient at twice signal frequency, and causes the disturbance dynamic 
CM offset mentioned above. 



Thermal Expansion 

Thermal expansion in the instrument has two effects. It can change the shape of 
the test masses, causing them to couple more strongly to gravity gradients, and 
it can directly affect the measurement. 

Thermal contraction coefficients of the test masses must be known to ~10% in 
order that they will come out the right size at low temperature. Great accuracy 
of knowledge is not necessary because the contraction is a small part of the 
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Fig. 13. Power Spectral Density of Trace in Fig. 12, in units of ^o/%/Hz. 



total size. The important thing is that the thermal contraction be uniform for 
the whole mass. Differences in thermal contraction can mechanically warp the 
shape of the mass much more than the actual thermal contraction would suggest. 
Current thinking is that the test mass thermal contraction needs to be uniform 
to about 10“^ to prevent warping. If this is the case, apparent changes in position 
due to nonuniform expansion coefficient will be limited to less than 1.4 x 10“^® 
m, which would not be expected to recur at signal frequency. This offset is 
insignificant for the acceleration measurement. 

The concern then is about the warping of the test masses. This might change 
the carefully minimized gravitational coupling terms. The total thermal expan- 
sion of beryllium (for example) over the range 0-293 K is about a^e = 0.00131; 
the masses must be oversized by this fraction to come to the correct size when 
cold. If the expansion coefficient is anisotropic, and differs by 10“"^ in perpen- 
dicular directions, the diameter/length ratio will change by about 10“^ ase = 
1.3 X 10“^ upon cooling from room temperature, and the length (or diameter) 
will differ from its intended value by about 2 x 10“® meters. This is comfort- 
ably smaller than the smallest relevant tolerance in the manufacture, 0.3 fx or 
30 X 10“® meters. 

Much larger dimensional changes could occur if there are gradients in the 
expansion coefficient over scales about equal to the size of the mass, because of 
thermal bending. The worst case thermal bending would be the case of a bender 
comprised of two strips of material with differing expansion coefficients welded 
to each other. The deflection of such a bender is roughly L'^Sa/S where L is the 
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Fig. 14. Measured Output, in units of ^o, GP-B SQUID with Temperature Controlled 
Electronics. 



length, Sa the difference in thermal expansion, and S the thickness; Substituting 
worst case values from our test mass dimensions, L = 0.069415 m. S' = 0.0025 
m, 6a = 10“'^ aBe j the deflection will not exceed 25 x 10“® meters. This is also 
less than our smallest required tolerance. Coupon samples will be taken from 
the material during machining to verify that the expansion differs by no more 
than a specified amount. 

The most stringent mechanical stability requirement comes from considering 
the case that one position sensing coil substrate undergoes a dimensional change. 
Then the resulting change in the radius of the thin film pickup coil will change 
the pickup coil inductance and could thereby mimic a change in position of the 
test mass. 

The mechanical stability requirement on the accelerometers is set to limit 
the possibility of false differential acceleration signals at signal frequency. Ulti- 
mately, the acceleration measurement is a measurement of test mass position. 
The natural period of the longitudinal mode of test mass oscillation is ~1000 
seconds, so «27rxl0“^ rad/s. Therefore the readout system requirement of 
differential acceleration resolution of less than 4 x 10“^® m s“^ in a 6 x 10“® 
Hz bandwidth implies the differential position measurement resolution in this 
same bandwidth must be 4 x 10“® m/s“^/(27r x 10“^ s“^)^ « 10“^^ m. The 
position of each test mass is measured by a differential circuit with a pickup coil 
near both ends of each test mass belt. This makes the measurement relatively 
insensitive to changes in the test mass size and accelerometer length, and also 
greatly eases the dimensional stability requirement of the accelerometer housing 
along the direction of the symmetry axis. 
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Using the STEP baseline pickup coil design parameters we have modeled the 
dependence of inductance on coil radius and thin-film trace width and compared 
this to the dependence of inductance on test mass position. We require that the 
dimensional changes in the sensing coil contribute at most 10% of the required 
test mass position resolution and obtain a mechanical stability requirement of 1 
part in 10“^^ for changes modulated at signal frequency. 

Given the systematic temperature difference requirement of 500 /rK, the me- 
chanical stability requirement will be easily met using either quartz or sapphire 
material to construct the accelerometer housing, bearing, sense coil, and elec- 
trode structures. For example, Braginski and co-workers [3] report measurements 
of the linear expansion of sapphire at 4.5 K to be ~5xl0“^^ with a good 
fit to the predicted law in there measurement range of 4.5 - 15 K. Taylor 
and co-workers [7] obtained similar results for a sapphire Fabrey-Perot opti- 
cal cavity between 5 K and 77 K. Measurements of Heraeus engineering grade 
quartz for the Gravity Probe B program indicate a linear expansion of below 
1 X 10“^° at 77 K while the higher grade Herasil 1 Top has an even smaller 
thermal expansion. 



Helium Tide 

If the liquid helium used to refrigerate the experiment has a free surface, it can 
move under small influences such as the gravity gradient of the Earth (~10~^ 
g), which are harmonically related to the signal frequency. This moving mass 
would produce significant gravitational disturbances to the experiment, which 
by frequency conversion may appear in the signal bandwidth. This is a concern 
because the estimated disturbance might be several thousand times the pro- 
posed sensitivity and is essentially impossible to calculate reliably. Moreover, 
separating this disturbance by rotating the spacecraft may not work well since 
it is locked to the signal frequency, although its amplitude and phase could be 
varied. The disturbance would be expected to change its signature significantly 
during the mission, as the helium boiled away and the free surface changed its 
size and response characteristics. 

We have observed that the liquid-vapor interface in an aerogel nearly filled 
with superfluid helium II takes a shape that is independent of gravity over times 
up to 3 hours (Fig. 15). The most likely reason that the interface is fixed is that 
its position is determined by capillary forces. The studies were for nearly filled 
aerogel, since this is when capillary forces are weakest and least understood, 
and the distribution of helium in the aerogel is then most susceptible to gravity. 
Apparently, therefore, the helium II liquid- vapor interface will not change shape 
in response to gravitational forces of 1 g or less for any amount of helium in 
the aerogel. This makes aerogel a very promising candidate for helium control 
in STEP. Aerogel is expected to eliminate this disturbance, so that elimination 
by experimental procedures or data analysis is no longer necessary. Studies are 
continuing on aerogels and other materials to determine their suitability for this 
purpose. 
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f = 0.4 f=n.3 f=0.2 f=0 



Fig. 15. Images of an aerogel sample as a function of the filling fraction, at 1.9 K. 
The dark regions are caused by an increase in light scattering in regions from which 
helium has been removed. The lower density region is below the higher density region, 
indicating a weak influence of gravity on the helium distribution. Tilting the cryostat 
11 degrees had no noticeable effect. 



Patch Effect 

The patch effect is the variation of electrical potential on conducting surfaces, 
which can give rise to spurious couplings between the test masses and their 
housing. Work function differences can occur in polycrystalline metals due to the 
surface exhibiting random crystal orientations or due to non~uniformly adsorbed 
surface layers. We have recently re-evaluated this potentially disturbing effect 
and performed a trade between it and the mass — housing gap, an important 
dimension in the baseline design. 

We have performed a theoretical analysis of the expected patch effect forces, 
force gradients, and dissipation and noise effects, and characterised the grain 
structure of sputtered Nb films which will coat the test masses, and which form 
the bearing and position sensing circuits. A theoretical study of the forces and 
gradients of forces due to variations of electrostatic potential over the surfaces 
of two adjacent metallic surfaces has been given for the LISA gravitational wave 
experiment in [8]. Based on these studies we conclude that the patch effect will 
not present a significant disturbance to the STEP experiment. Therefore, the 
patch effect does not determine gap spacing tradeoff in the baseline accelerometer 
design, although it could become important for significantly smaller gaps than 
our current baseline of 100 /im. The error analysis includes the following effects 
to determine design tradeoffs: bearing magnetic fields, effects of trapped flux, 
capacitive forces and dissipation (in particular with non-zero charge on test 
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masses), the variation of inductance with distance of position sensing circuits, 
including its first and second derivatives, and gravity gradient couplings. 

A basic result for patch effect forces is as follows: If there is significant co- 
herence between the 2-dimensional potential distributions on both surfaces and 
there is significant power in the spatial spectrum of potential variations at wave- 
lengths comparable with the spacing between the surfaces, then the force gradi- 
ents are approximately e^AV^ /a^ where a and A are the separation and common 
area of the electrodes respectively, and V is the amplitude of the variation in 
potential. If all the spectral power lies at wavelengths much smaller than the 
separation, the forces and force gradients are reduced exponentially. If there is 
no correlation between the two distributions of potential then there will be no 
force or force gradient. Correlations between the distributions for clean surfaces 
will in general be determined by the distribution of surface grains, discussed 
below for Nb films similar to those for STEP. 

Differences in the parasitic stiffness coupling the test-masses to the space- 
craft, caused by patch effect, will not lead to a loss of common-mode rejection 
because they will be trimmed out during setup by adjustment of the detector 
supercurrents. The stiffness due to the patch-potential correlations must not 
change the resonant frequency by more than a factor 10“^. STEP can toler- 
ate potential variations of 0.3V and separations of 0.1 mm. The drag-free (or 
common-mode acceleration noise) requirement on the patch-field force gradients 
should be approximately satisfied by the current STEP accelerometer design ir- 
respective of the spatial distribution and correlation of the patch-potentials. 

Any variation in time of the forces and force gradients due to patch-fields 
will necessarily produce energy losses and will affect the quality factor of the 
accelerometer. We have shown that, provided the thermal noise is sufficiently 
low to satisfy the STEP science goals, the patch-potential variations will not 
produce significant differential forces on the test-masses. 

Perhaps the most likely way that the surface potentials can be modified over 
time and over distances large enough to cause a correlation is through the charge 
management system. The UV radiation will liberate surface contaminants as well 
as electrons. The work functions of both the test-mass and the levitation coil 
will be modified by the UV exposure (and the high energy protons in the SAA) . 
Some of these free ions will have low energies and will move in the electric 
fields generated by the patch fields near the metallic surfaces and presumably 
will occupy potential wells due to the patch fields. Changes in the electric field 
experienced at one surface due to the presence of the other will lead to a change 
in the population of adsorbate atoms and corresponding losses. The time taken 
for an adsorbate to diffuse will depend on the depth of the potential well in which 
it sits [2] . Such motion of adsorbates will give rise to losses in energy. Depending 
on the number density distribution of wells as a function of energy this will give 
rise to 1/f noise in a fashion similar to anelasticity in mechanical suspensions. 

If the patch fields are correlated initially, charge motion will take place in 
order to anti-correlate them as much as possible. After a time the stiffness will 
reduce by an amount AKp which is analogous to the lossy component of a 
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mechanical spring. We can put an upper limit on AKp using the requirement 
that the common-mode rejection ratio be stable to one part in 10 ^, AKp « 
where u> is the resonant angular frequency of the differential mode of 
the accelerometer with equivalent mass m. Estimates for AKp lead to a condition 
on uj. The spectral power density of thermal noise forces produced by fluctuations 
in the surface contaminants is expected to be = 4kT AKp/co, which must 
be less than 10“^° to achieve the STEP goal. In the worst case the 

thermal noise is generated entirely by motion of surface contaminants. Thus AKp 
must be less than 10~^°w/4fcr. Thus the common-mode rejection ratio will be 
stable enough provided iv/AkT. Using an angular frequency of 

to = 10 “^, we find that this inequality is satisfied with an order of magnitude to 
spare. 

Niobium thin films grown by sputtering tend to be very fine grained poly- 
crystalline materials. Niobium films of order a micrometer or larger in thickness 
become quite rough. These films and the oxide layer on them have been well 
characterized. One point to consider is that any niobium surface which has been 
exposed to the atmosphere will develop an oxide coating consisting of amorphous 
Nb 2 O 5 , which is an insulator, mechanically hard and stable, dense and well 
bonded to niobium. In STEP these films will have to be covered with an inert, 
thin, slightly conducting layer to prevent charge buildup. 

Fig. 16 below shows an example of a niobium film grown at Stanford for 
coating on quartz substrates. The small grain size suggests that any correlation 
of surface potentials will be due to overall texturing rather than specific details 
of the grains. 

6 Conclusion 

We have presented a simplified description of the STEP apparatus and proce- 
dures, followed by highlights of recent technical progress. The STEP accelerome- 
ter design has been advanced to a stage where we can build a prototype flight ac- 
celerometer based on previous working models. Particularly important progress 
has been made in a unified error model for the experiment and major distur- 
bances, and in practical solutions to some longstanding problems such as helium 
tide and patch effect disturbances. 
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and the average grain size is closer to 0.1 /rm. 
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Abstract. Free fall tests for proving the Weak Equivalence Principle (WEP) have been 
rarely be done in history. Although they seem to be the natural experiments to test the 
equivalence of inertial and gravitational mass, best results for proofs of the WEP could 
be attained with torsion pendulum tests to an accuracy of 10”^^. Pendulum tests are 
long term periodic experiments, whereas free fall tests on Earth can be carried out only 
for seconds causing certain limitations in principle. Nevertheless, very precise fall tests 
in the to 10~^® range are possible and under preparation to be carried out on the 

Drop Tower Bremen for a free fall over 110 m. These tests require position detectors 
with an extremely high resolution in order to measure tiny displacements of freely 
falling test masses. Using SQUID-based sensing technique, the displacements can be 
determined with an accuracy of 2 x 10”^^ m/\/Hz. The SQUID system, developed and 
manufactured at Jena University, provides high sensitivity and extremely low intrinsic 
noise, especially at low frequencies. Some recent results are discussed. 



1 Introduction 

The study of gravitational interactions is an important challenge for basic sci- 
ence. Many theories have been developed to describe the possible material de- 
pendent coupling of masses to gravitation, but up to now a corresponding ex- 
perimental proof is missing. The evidence of such a dependence is indicated by 
a violation of the Weak Equivalence Principle (WEP). This principle requires 
an exact equality between gravitational mass rUg and inertial mass my The 
experiments carried out to test the WEP may be classified into two groups: 

1. Torsion balance experiments and 

2. Galilean type (or drop tower) experiments. 

Torsion balance experiments carried out by Dicke [1] and Su et al. [2] placed an 
upper limit of 5 x 10“^^ on the so-called Eotvos-factor 

(mg/mi)A + (TOg/mi)B 

This is still the best proof of the validity of the WEP. 



C. Lammerzahl, C.W.F. Francis, and F.W. Hehl (Eds.): LNP 562, pp. 248-272, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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Although Galilean tests seem to be the easiest way to prove the WEP, the 
best experiments as performed by Niebauer et al. [3] and Kuroda and Mio [4], 
e.g., allow an accuracy of 5 x 10“^° only. In these free-fall experiments absolute 
gravimeters were mainly used adapted especially to this application. Although in 
these experiments the difference signal is detected, the measurement is absolute 
in a sense that the unavoidable time delay between the release of both test masses 
at t = 0 will cause a displacement As ~ g At t which can be larger by orders of 
magnitude than the expected signal due to a violation of the WEP in the aspired 
limit rj < 10“^^. On the other hand, the short free-fall time of 0.3 s only yields 
a very small gravity dependent relative shift in the position of the test body. 
The Drop Tower Bremen as an earth bound microgravity facility with a free-fall 
distance of 110 m and a flight time of 4.7 sec offers the unique possibility of a 
pseudo-Galilean test of the WEP in which a true relative measurement can be 
performed. This can be realized by integrating the measuring device into the 
free falling system so that displacements caused by the time delay become very 
small. The experiments cover the interesting and rarely measured distance range 
of up to 100 m and are ideal tests for the planned space experiments like the 
Satellite Test of Equivalence Principle (STEP). 

In this drop tower experiment as well as in the satellite STEP experiment 
of NASA/ESA, one wants to prove a possible violation of the WEP at a level 
of an accuracy up to one part in 10^^ and 10^®, respectively. High performance 
SQUIDs will be used as ultra sensitive superconducting position detectors for 
measurements of tiny displacements of test masses [5,6]. 

2 Experiment Description 

2.1 Drop Tower Bremen 

The Drop Tower Bremen is a 146 m high building located at the campus of 
the University of Bremen, Germany. Inside capsules weighting up to 500 kg, 
experiments can be carried out under conditions of weightlessness during 4.74 ses 
in free fall inside a 100 m high drop tube. The residual disturbing acceleration is 
reduced to 10“® to 10“^ m/sec^ (« 10“® to 10“®(/, where g is the local attractive 
acceleration of the Earth gravitational held) below the frequency range of 100 
Hz. This is achieved essentially by pumping down the entire drop tube inside 
the concrete tower to a vacuum pressure of 1 to 10 Pa. By using a special release 
system for the drop capsule, the rotation of the capsule during free fall can be 
minimized down to 0.5°/sec. To minimize the transfer of forces induced by wind, 
the drop tube is free-standing on a 2 m-thick concrete ceiling without any link 
to the surrounding concrete shaft. The deceleration chamber, 10 m in height and 
9 m in diameter, is integral with the tower foot. After free fall in the Earth’s 
gravitational held under vacuum the capsule’s velocity is about 46.5 m/sec; for 
breaking, the capsule falls into a tank 8 m high filled with 30 m® fine graded 
polystyrene. Although the impact seems to be a strong mechanical shock, the 
deceleration system in fact provides a very soft landing. The deceleration process 
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lasts about 0.2 sec with an average deceleration rate of 250 m/sec^, the maximum 
peaks do not exceed 500 m/sec^. 

Drop capsules are cylindrical in shape; 3 m high and 80 cm in diameter. At 
the front of each capsule, a cone 50 cm long is mounted to stabilize it during im- 
pact deceleration. The experiment is mounted on horizontal platforms inserted 
in a system of 4 vertical stringers. After assembly and final tests a vacuum-tight 
cover is slipped over the stringer structure. Normal pressure conditions inside 
the capsule are helpful, because additional provision for adaptation of the ex- 
perimental equipment to vacuum conditions can be avoided. The experiment 
platforms are of ’’sandwich” construction, made from wood covered by thin alu- 
minum plates, and have excellent damping characteristics. At the top of the 
tower, the capsule is connected by an umbilical cord and helium vent lines. Just 
before release, the umbilical is drawn and the vent lines become closed to avoid 
any disturbance to the freely falling capsule [7]. 

In optimizing the release process, not only the possible rotation of the falling 
capsule must be considered but also the disturbing structure vibrations induced 
during release. Rapid release causes oscillations of the structure along its cylin- 
drical axis. These oscillations, as a result of relaxing the stretched structure, 
do not depend on any properties of the structure’s material, which only influ- 
ences the natural frequency of the structure, and must be damped within ade- 
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quate time (ca. 0.15 sec). Beside release process and air drag already mentioned, 
some other physical reasons exist which limit the the possibility of attaining real 
weightlessness in the sense of total compensation of the gravitational force and 
disappearance of any disturbing acceleration. In [8], disturbance effects are clas- 
sified and listed. Nevertheless, carrying out a gravitational experiment at Drop 
Tower Bremen needs careful analysis of further disturbing effects which will be 
described below. 

2.2 Experimental Set Up and Timing 

The drop tower experiment yields to compare the gravitational accelerations 
acting on two test masses falling freely in the Earth’s gravitational field [9]. 
The test masses of different composition (e.g. niobium, aluminium, or lead) are 
hollow cylinders aligned along their vertical axes to place the centres of mass 
(CM) at the same point. Position differences between the test masses caused 
by, no matter what, will lead to a differential acceleration due to the Earth’s 
gravitational field, which could mask a possible material dependent difference 
of accelerations by magnitudes. Therefore in the ideal cylindrical constellation 
(both CMs the same), both cylinders suffer the same acceleration by the Earth’s 
gravitational field and their mutual gravitational attraction will become zero. 
The experiment is carried out in a small high vacuum container cooled down in 
a liquid helium Dewar vessel to a temperature of 4.2 K. Because the vacuum 
inside the tower’s tube is insufficient to attain a residual acceleration level of 
< 10“® m/sec^ (« lO^^g), the Dewar is fixed on a free-flyer platform with a 
diameter of 460 mm and a height of 1190 mm. Handling the entire experimental 
apparatus inside the drop capsule as a free flyer additionally released just after 
the drop capsule’s release, influences by air drag can be minimized. 

Both test masses become adjusted with an accuracy of 1 /rm by a position coil 
system at top and bottom of each test mass. After positioning which needs about 
0.5 sec only, the test masses are released inside the vacuum container by shut-off 
of the position coil system. Vacuum is needed to minimize mechanical noise due 
to Brownian motion of the gas atoms surrounding the test masses; the resulting 
drift velocities of < 10 /x m/sec during free fall are small enough to neglect air 
drag effects inside the vacuum container. Cylindrical meander coils surrounding 
the test masses, which have a superconducting coating, levitate them along the 
two axes perpendicular to the cylindrical symmetry axis. Levitation is based on 
the phenomenon of magnetic flux exclusion from the bulk of a superconductor, 
called the Meissner-Ochsenfeld effect. 

The tiny differential displacements and accelerations are measured by SQUID- 
based sensors. SQUIDs sense the relative displacement of the test masses as an 
inductance change with ultra high accuracy of 4 x 10“^^ m/-\/Hz. In principle, 
a pair of pick-up coils, wound in series opposition, are placed on either side of a 
thin superconducting diaphragm covering each test mass. When the pick-up coil 
system is connected across the input coil of the SQUID any small movement of 
the test mass results in an output signal of the SQUID. Each test mass acts as a 
superconducting tuning slug changing the inductance of the pick-up coils placed 
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in front of the top and bottom area of the cylindrical test masses. In the initial 
position a persistent current has to be trapped into a superconducting input cir- 
cuit with definite flux. A motion of the test masses modulates the inductivity of 
the pick-up coils and, since the flux in the superconducting circuit is constant, 
forces an additional screening current through the pick-up circuit which will be 
detected by the SQUID. Such high precision measurements need robust SQUIDs 
unaffected by repeated thermal cycling, with an extremely low noise especially 
at low frequencies, excellent long term stability of all parameters, and immunity 
to mechanical shocks, vibrations, and r.f. interference. 

After a drop time of about 4.3 sec, the experiment is finished by clamping 
and locking the free flyer platform and re-caging the test masses to avoid any 
destruction of the experimental equipment. 

2.3 Main Error Sources 

The accuracy of the experiment depends directly on the precision of the test 
masses’ alignment and orientation after release. Disturbances during release re- 
sult in initial displacements {Az yf 0) and initial velocities {Av yf 0) causing an 
acceleration difference of the test masses Ziugrad due to the Earth’s gravity gra- 
dient and the mutual gravitational attraction of both test masses, respectively. 
This relative acceleration may dominate the potential material dependent ac- 
celeration Z\owEP by orders of magnitudes. To elude this problem by taking 
into account the desired accuracy of the apparatus, boundary conditions for the 
experimental set-up can be derived in the following manner: 

1. Choose of proper cylinder materials (appropriate for machining), and 

2. Optimizing the geometry of the cylinders by keeping the contribution of 

Aograd clearly below Auwep- 

The initial values of the maximum acceptable relative displacement and ve- 
locity of the test masses are directly connected to a proper choice of the cylin- 
ders’ materials, with respect to their densities, and their geometry. With suitable 
values it is possible to compensate the differential acceleration of the inhomo- 
geneous Earth’s gravity field by the mutual gravitational attraction of the test 
masses. The motion of the free falling test masses is described by two coupled 
differential equations (assuming a motion only in 2 -direction), which have to be 
solved numerically: 

0-1 = —^^^earth ~ Gd^ f h £, 

J Zi- Z2 

02 = -a,<?earth ~ Gd, [ , (2) 

where 01^2 are the test masses accelerations, 21^2 their coordinates in fall di- 
rection, and pi ^2 their densities, respectively, dearth is the Earth’s gravitational 
potential, and G Newton’s gravitational constant. A material dependent accel- 
eration of the test masses is taken into account by adding e to the equation of 
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Fig. 2. Displacement Az{t) of the test masses in presence of a material dependent 
gravity force (e yf 0) taking into account an initial displacement = 10 fim and an 
initial velocity Svo = 10 fim/sec. 



motion of one test mass. The second term of eq.(2) is given by: 

d^<Plyi = GTTp,f{Lj,Zi- i = 1,2, i^j, (3) 

where L denotes the cylindrical length, ri„ and Tout the inner and outer radius 
of the cylinders, respectively, and p the density. is the gravitational poten- 
tial of the z-th cylinder. Eq. (3) shows that optimization is a pure geometrical 
problem and could be solved for each test mass pair individually. In practice, we 
choose only one geometrical configuration for all test mass pairs and try to com- 
pensate the incomplete compensation by perfect release and positioning of the 
test masses. For a detailed problem description, see [10]. If initial displacement 
and velocity do not exceed certain values of Azq and Z\uo (declined by the cylin- 
der design), fictitious position sensitive detectors with a resolution capability of 
Az ~ 10“^^ m show a motion of two test masses which do not interact gravi- 
tationally and move coincidently in a quasi-homogeneous gravity field. Hence, 
the differential motion looks linear and a WEP-violation should be observed as 
a relatively accelerated motion. 

Assuming a material dependent acceleration difference (s yf 0) of Aaja = 
10“^^ an approximation of the additional £-term is given by 



10-13 g^earth 



(4) 



where Mearth denotes the Earth’s mass. The numerical solution of the differential 
equation system (2) neglecting the cylinder-cylinder gravitational interaction is 
shown in Fig. 2. Acceleration difference and displacement are corrected as de- 
scribed above. To test the WEP with an accuracy of 10“^^ by use of optimally 
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designed test masses^ (see Fig. 3), the effecting resolution of the position sensing 
SQUID-based system has to be Az < 10“^^ m. Disturbances due to a non- 
momentum free release resulting in different initial velocities of the test masses 
may be acceptable if they do not exceed Avq = 10 /rm/sec. The maximum dis- 
placement tolerance of the CMs of the test masses is Azq = 10“® m without any 
loss in performance. These requirements are in accordance with our experimental 
results for positioning the test masses. 

Temperature drifts inside the 
vacuum container during the mea- 
surement would lead to a linear 
expansion of the test masses su- 
perposing the displacement mea- 
sure, and temperature gradients 
in the vacuum container may 
cause a non-symmetric temper- 
ature gradient inside the test 
masses effecting a gravity gradi- 
ent dependent influence on the 
free fall motion of the test masses. 

Considering the maximum ac- 
ceptable tolerance of the CM dis- 
placement of 10 /rm during free 
fall, temperature must be stable 
within an accuracy of ±0.1 K. 

Measurements showed that the 
temperature is increased by an amount of < 0.05 K only during the drop time. 
The temperature increase is correlated by a small He-pressure increase inside 
the Dewar vessel, because the exhausting valve has to be closed during free fall 
to avoid a reduction of the quality of the microgravity. Only a slight increase of 
the helium level (ca. 8.5 %) is observed. 

In addition, several experiments have been performed to measure the rela- 
tively strong magnetic field components (vertical). By (horizontal), and the 
vertical gradient dB^/dz inside the steel tube during free fall. The diameter 
of the pick-up coils connected across the SQUID input was 1 mm only. The 
magnetic field pattern observed (see Fig. 4) was caused by the welding seams 
and changing magnetic remanence of the tube segments. After shielding the 
experiment perfectly by superconductors, the disturbing magnetic field signals 
disappear. 

^ In inhomogeneous gravitational fields the acceleration of a test body depends not 
only on the position of its center of mass but also on its shape. The geometry of 
both test bodies were numerically optimized in such a way that these test bodies 
move with nearly the same acceleration in the inhomogenous gravitational field of 
arbitrary disturbing masses nearby provided, of course, that the centers of mass of 
the test bodies are equal. 




Fig. 3. Test masses: inner test body made of 
special lead alloy (left); outer test body made 
of aluminium coated with niobium (right). (For 
comparison: a coin of 1 Deutsche Mark at the 
lower right.) 
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Fig. 4. Magnetic field inside the drop tube (measured during a free fall experiment 
using a SQUID-based magnetometer). 



Principle limitations of the resolution and systematic errors of the SQUID- 
sensor and the sensing system are discussed separately below. 

3 SQUID Based Position Detector 

In order to realize the precision measurement of tiny displacements of free 
falling test bodies, highly sensitive SQUID-based measurement equipments are 
required. The Superconducting Quantum Interference Device (SQUID) is the 
most sensitive magnetic flux detector known today. The operation of SQUIDs is 
mainly based on two effects, observable only at low temperature in the presence 
of superconductivity: 

• flux quantization in superconducting loops and 

• Josephson effects. 

SQUIDs has often been referred to as a cryogenic or superconducting magne- 
tometer, and indeed the measurement of extremely small magnetic fields is one of 
the most important applications. However, the scope of the SQUID’s usefulness 
extends far beyond simple field measurements. Almost any low-frequency signal 
that can be converted into a corresponding magnetic flux signal will be detected 
at a greater level of sensitivity with a SQUID than with any other instrument. 

Applications of SQUID systems have ranged from the investigation of mag- 
netic and electronic properties of materials to biomagnetic research, and from 
the measurement of millikelvin temperatures to the detection and characteriza- 
tion of subsurface magnetic structures on land and at sea. In recent years the 
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Fig. 5. DC SQUID UJ 111. Chip fixed on a printed circuit board with contact pads. 



application of high performance SQUID measurement systems in fundamental 
experiments like the proof of the universal proportionality of inertial and gravi- 
tational mass is one of the most promising field in modern physics. 

3.1 The DC SQUID 

The magnetic flux sensor employed in this work is an eight-loop thin film 
DC SQUID UJ 111 [II] based on Nb-NbOa;-Pb/In/Au window-type Joseph- 
son tunnel junctions developed and fabricated at the Department of Physics of 
Friedrich-Schiller-University Jena. In contrast to other sensors the SQUID UJ 
111 was designed for universal applications in precision measurement technique 
and works at an extremely low noise level also in a magnetically unshielded 
environment. 

Each Josephson junction is resistively shunted by strips of Ag/In alloy having 
a resistivity of 2 17 at 4.2 K. The integrated input transformer consists of two coils 




Fig. 6. Simplified layout of the thin film DC SQUID UJ 111. See text for details. 
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Table 1. Parameters of the DC SQUID UJ 111 (flux quantum ^o=2. 07x10 Vs). 



Parameter 


Value 


SQUID inductance LsQ 


40 pH 


Input inductance Li 


0.8 /iH 


Junction shunt resistance R 


2 n 


Junction capacitance C 


« 0.6 pF 


Stewart-McCumber parameter /3c 


< 0.5 


SQUID parameter /3 _l 


1 


Junction critical current R 


3-30 /iA 


Number of turns in input coil N\ 


2 X 18 


Coupling coefficient k between SQUID and input coil 


0.9 


Input current sensitivity AIijA^ 


0.4 /xA/$o 


Modulation current sensitivity Aim/ A<1> 


20 /xA /^o 



of 18 turns each, connected in a gradiometric configuration, providing an input 
inductance of about 0.8 /xH. The one turn flux modulation coil is inductively 
coupled only to one half of the gradiometric SQUID loop system. The contact 
pads for bias current, voltage measurement, and flux modulation are made of 
niobium and are connected with the printed circuit board of the sensor using 20 
/im thick aluminium wire and bond techniques. Niobium wire (thickness: about 
25 /xm) is also used for the bonded interconnections between the input coil and 
the niobium input leads of the SQUID. The complete SQUID has been integrated 
on a 3 mm x 5 mm silicon chip and is placed on a printed circuit board with an 
overall size of 8 mm x 16 mm (see Fig. 5). 

A simplified structure of the integrated DC SQUID sensor, as described 
above, is shown schematically in Fig. 6. An overview on some important elec- 
trical parameters of the applied SQUID type is given in Table I. 

The long term stability of the parameters given during a time period of 
several years is remarkable although the SQUID described is not encapsulated 
hermetically. According to our experience there is no influence on the SQUID 
parameters even after more than 100 cooling down cycles. It should be pointed 
out that the SQUID sensors are also very insensitive to mechanical shocks tested 
by free-fall experiments over a height of up to IIO m causing a deceleration of 
up to 500 m/sec^ (50 g) at the end of the flight. 

3.2 SQUID Control Unit 

The SQUID electronics consist of the low noise preamplifier and the SQUID 
control and detector unit (see Fig. 7). The low source impedance of the SQUID 
(about 1 17) is stepped up to the optimum impedance of the preamplifier with 
the help of a matching transformer. The first stage of the preamplifier consists 
of two low noise junction field effect transistors (i.e. Toshiba 2SK146) coupled 
in parallel with a noise optimum source impedance within the range of 1 and 
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Fig. 7. Photo of the SQUID Control unit with preamplifier. 



100 kl7. Thus, the matching condition for low noise operation is well satisfied. 
The choice of the cables connecting the transformer with the SQUID and the 
preamplifier, respectively, is usually rather critical. Twisted pairs of teflon in- 
sulated copper wires were used to keep the contribution of current noise from 
the cables negligible. Each pair is installed in a grounded Cu-Ni-tube (diam- 
eter: 1 mm) preventing disturbances by RF interferences. The contribution of 
the SQUID preamplifier to the total voltage noise observed was negligible in all 
cases reported. 

The d.c. bias and fiux modulation current (/ = 125 kHz) are fed into the 
SQUID via voltage-controlled current sources situated in the preamplifier case. 
The amplification and detection of the SQUID signal is achieved by the state- 
of-the-art design (Fig. 8), i.e. the preamplifier is followed by an AC amplifier and 
phase sensitive detector with a Pl-type integrator. The output signal returns via 
a resistor RF to the modulation coil to close the feedback loop. Furthermore, 
the SQUID control unit has three different modes of operation: 

• the test mode, 

• the small signal measurement mode, and 

• the fiux-locked-loop mode. 

All experiments were usually performed with standard cryogenic equipment 
at liquid helium temperature. No special care was taken to stabilize the bath 
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Fig. 8. Block diagram of the DC SQUID Controller. 



temperature. The cryostat was especially developed for drop tower experiments 
and contains a maximum of 20 1 liquid helium. All parts of the dewar were 
made of nonmagnetic stainless steel. As radiation shields, layers of 25 nm thin 
semireflecting aluminium coated Mylar foil were used. In this configuration an 
operating time of more than 10 hours is available. 

The ground based measurements to test the performance of the SQUID mea- 
surement system and the position detectors were performed in a open labora- 
tory at the Friedrich-Schiller-University Jena. The low frequency noise spectra 
(Fig. 9) were measured with a Hewlett-Packard HP 3582A Fast Fourier Transfor- 
mation (FFT) spectrum analyzer using the Hanning window. The white noise 
levels were obtained by calculating the FFT. To attain the result in units of 
the flux quantum, the spectral density of the voltage was divided by the change 
of the output voltage corresponding to one flux quantum <Pq measured before. 
In noise measurements the SQUID was operated with a peak-to-peak modula- 
tion of <^o/2. In general, the optimum noise performance was obtained when the 
SQUID was biased slightly above the critical current. 

For an optimum choice of bias and flux modulation point, a white noise flux 
spectral density of 2 x 10“® <Po/V^ for the SQUID system was found when the 
antenna was not connected but the SQUID input coil was shunted. This flux 
noise corresponds to an equivalent current noise through the input coil of 0.9 
pA/\/Hz, an effective energy factor of 543 h {h = Planck’s constant), and an 
energy resolution of 3.6 x 10“^^ J/-\/Hz [11]. 

3.3 Detector Principle 

The main components of a SQUID position detector consist in a superconducting 
pick-up coil and a superconducting diaphragm covering the test body. When 
this pick-up coil is connected across the input coil of the SQUID and a definite 
current is trapped in this closed superconducting loop any small movement of 
the test mass results in an output signal of the SQUID. 

We started our investigations with a position detector using one pick-up coil 
only [12]- [14]. The basic scheme of an improved version of a position detector 
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frequency [Hz] 

Fig. 9. Low frequency noise of the Jena DC SQUID system with the sensor UJ 111. 
For this measurement the sensitivity of the SQUID system was adjusted to 10 V/^o- 

with two pick-up coils is shown in Fig. 10. The test mass changes the inductance 
of both pick-up coils L\ and L 2 . In the initial position a persistent current Iq has 
to be fed across heat-switch Si into the superconducting pick-up circuit causing 
a definite magnetic flux. During this procedure the input coil of the SQUID is 
decoupled by heat-switch S' 2 .^ The motion of the test mass modulates Li and L 2 
and, since the magnetic flux in superconducting loops is constant, the variations 
of the inductance of both coils force a screening current li corresponding to the 
displacement of the test body Ax to flow in Li. 

The circuit can be described by the following equations: 

I 2 =h- li (5) 

= L2I2 - L,h ( 6 ) 

=Lih + LJ, (7) 

Eqn. (5) comes from the current conservation. Equations (6) and (7) come from 
the flux conservation in two superconducting loops of the circuit, where <Pi and 
<Lii are the fluxes trapped in these loops. These equations correspond to the 

^ The heat switches consist of a small heater (a special non-magnetic resistor) and 
a thin superconducting wire (niobium) possessing a good thermal contact to the 
heater. The superconduting wire is connected with the remaining superconducting 
circuit whereas the heater is insulated electrically from the circuit and has its own 
power supplies (not shown in Fig. 10). If a current flows through the heater, the 
temperature increases and the superconducting wire becomes normal conducting. 
Since the remaining circuit is superconducting, no current flows through the normal 
conducting part. If the power supply of the heater is disconnected, the switch becomes 
superconducting again, and the circuit preserves its magnetic flux distribution. 
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equations for the voltage in room temperature circuits. The solution for the 
current li is 



<PiL 2 — ^iiLi 
Li{Li + L2) + L1L2 



(8) 



The sensitivity of the detector depends not only on the magnitude of the 
fluxes but also on their sign. In order to get a maximal sensitivity for displace- 
ments in the a;-direction and a minimal sensitivity perpendicular to x the fluxes 
and <P\i should have equal magnitudes and signs. Two switches are neces- 
sary to feed two defined fluxes into the circuit (Fig. 10). The current should be 
supplied across switch Si, when the test mass is centered (Ti = L 2 ). Assuming 
a linear ansatz for the inductance (corresponding to the dependence of plunger 
type coils) 

Li{x) = Lq + L' X , L 2 {x) = Lq — L' X , (9) 

and considering the initial conditions 

= LqIq , = LqIq , (10) 



we get for the current If. 

I = -^L'xIq _ -2 

* 2L, + Lo- (L'x)yLo ^ 2L, + Lq ’ 



( 11 ) 



where Iq is the current fed into the circuit at position x = 0. The dependence 
Ii{x) is linear for small displacements x. The ansatz (9) describes approximately 
the inductance of plunger-type pick-up coils in their optimal working range (see 
also section 4.1). In order to get a detector with a linear dependence A(x) within 
a larger range one has to use coils with a special dependence of the inductance: 



Li{x) 



1 

u + vx ’ 



L2{x) 



1 

u — vx ^ 



(12) 



where u and v are arbitrary parameters. However, it is difficult to produce wire 
wound coils possessing such a dependence over a wide working range. 
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According Eqn. (11) the detector measures the absolute position from a fixed 
reference point, where li = 0. However, this reference point is not the centered 
position of the test body but depends on the position of the test body at the time 
the flux was fed into the circuit. Therefore, the reference point varies slightly 
each time a new current is fed into the detector circuit. Therefore we get a high 
accuracy of measurement of relative displacements whereas the accurracy of the 
absolute position depends on the reproducibility of the supplied fluxes and is 
much lower. 

The current A is measured by the SQUID. As we described above we use 
a flux locked loop electronics for readout. This electronics react to the detector 
circuit by an integrated feedback coil. As long as the feedback loop of the SQUID 
electronics is closed the reaction can be taken into account by using a smaller 
effective inductance for Li instead of the real inductance of the SQUID input 
coil. This corresponds to the behaviour of a transformer where the effective 
inductance of the primary coil depends on wether the secondary coil is open or 
bypassed. But note that the problem becomes more complicated if the feedback 
loop of the elektronics is opened for a moment. 

The most important advantage of this detector, compared with a detector 
with a single pick-up coil, consists in the fact that the current /q is not limited 
by the critical current of the input coil of the SQUID [14], as long as x will be 
small. According to 13, a higher current /q will provide a much better resolution 
5x of the detector: nr , t i i 

& = 1 i , (13) 



2kyjLiLa,Q /q 01 



where Lq is the inductance of both pick-up coils in the initial position of the test 
body (middle position), k is the coupling factor between input coil and SQUID 
loop, Lsq is the SQUID inductance, 6<PsQ is the fiux resolution of the SQUID, 
and a = \dLi/dx\ = \dL 2 /dx\. 

In order to characterize a detector or a converter it has to be distinguished 
between sensitivity and resolution. The sensitivity is the ratio of the output 
signal (e.g. the output voltage of the SQUID electronics) to the input signal 
(e.g. the displacement). The resolution is the smallest difference of two values 
which can just be distinguished. Often, the resolution is limited by the noise of 
the detector. In this case, the absolute resolution can be increased by averaging 
over a long time. Therefore, the so-called noise limited resolution is used to 
characterize these detectors. This is the product of absolute resolution and the 
square root of the averaging time (one divided by square root of measurement 
bandwidth). For instance a position resolution of Sx = 10“^^ means: 

• If you measure the position of a test body at two different times by averaging 
the signal over 1 s per measurement point (or by using a low pass filter of 1 
Hz), then you can dectect displacements of 5x > 10“^^ m 

• If you average over a time period of 10 ms per measurement point, the 
smallest detectable displacements is 10“^^ m only. 



If the signal changes during the averaging time or the noise of the detector is 
not white, the problem becomes more complicated. 
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Fig. 11. Differential detector with two test masses. 



The resolution of a detector depends on its sensitivity. A good resolution can 
only optained, if the sensitivity is sufficient high. However, an increasement of 
sensitivity (e.g. by amplification of the output voltage) not necessarily improves 
the resolution, because both signal and noise will be increased. 

The position detector interacts mechanically with the test mass because each 
pick-up coil acts like a small magnet which repels the superconductive covered 
mass. For the centered position of the test mass both forces compensate each 
other. For this position we can calculate a spring constant by twice differentiating 
the energy in the circuit: 

^ _ 1 (j>i + 

2 L 1 L 2 + L\Li + Li2Li 

For most coils the spring constant is positive but for certain geometries also 
negative spring constants are possible. For plunger-type coils negative spring 
constants can occur if both ends of the test mass are completely inside the two 
pick-up coils. After a small displacement one coil pushes the test mass further 
into the other coil. The gradient of the gravitational field of the earth causes an 
effect which can be described by a negative spring constant. The resulting total 
spring constant can be compensated by additional levitation coils. 

A second important parameter of the detector, besides the resolution, is the 
dynamic range. This means the ratio of working range to the smallest signal 
which can be detected. The SQUID works up to its maximal input current of 
2 mA (corresponding to about 5000 <Po) without any loss of sensitivity. Above 
this current some parts of the input coil become normal conducting. The usable 
working range is limited by the SQUID electronics. Because the SQUID is de- 
signed for precision measurement, it has only a small integrated feedback coil. 
The maximum working range of the SQUID electronics (±100 ^ 0 ) is limited by 
the relatively high feedback current needed. Even smaller working ranges of 1 
or 10 ^0 will be used in the final experiments due to certain advantages (faster, 
less noise). With a resolution of 2 x 10“® ^o/VHz for our best SQUIDs we get 
a dynamic range of about 10®. This corresponds to the dynamic range of our 
24-bit sigma-delta A/D converter (121 dB at 500 Hz). 
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Fig. 12. Dependence of the inductance of a pick-up coil on the position of the test 
body. See text for details. 



During the measurement time of about 4 sec the test mass should not leave 
the working range of the detector. For a detector with a resolution of 10“^^ m 
the velocity of the test mass should not exceed 0.25 /rm/sec. Before the release all 
parts of the capsule are tensed by their own weight. After the release the struc- 
ture contracts and accelerates the test mass to a velocity of about 1 cm/sec. 
Therefore the test mass has to be fixed during the release of the capsule and 
after the vibrations have faded away it has to be placed in the centered posi- 
tion. This can be done by mechanical means (piezo elements) or with additional 
suspension coils. In our recent experiments we measure the displacement of two 
simultaneously free falling test masses. A violation of the WEP corresponding 
to an Eotvos factor 77 0 will cause a displacement Ax between the two test 

masses during the fall. If both test bodies start with the initial velocity of uq = 0 
then this displacement is: 

Ax = r]So, (15) 

where Sq is the drop length. 

The vertical gradient of the gravitational field of the earth causes a displace- 
ment of bodies if their centres of mass are not at the same height. Therefore the 
centres of mass of the two test bodies have to be in coincidence with an accuracy 
of < 1 /rm. The position detectors of the two test bodies can be combined in a 
circuit [15], as shown in fig. 11. The current in the SQUID input coil is: 

j ^ - ^2^2"^ + <^3^3 ^ - <^4^4 ^ 

' + 1 ’ 

where {m = 1, . . . ,4) is the flux in the loop consisting of pick-up coil 
and the SQUID input coil Li. It is possible to balance the circuit by adjusting 
the ratio of the two currents fed into the circuit (across switch Si and S'4, re- 
spectively). For such a balanced detector the current A would be proportional 
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0 2000 4000 6000 

Distance (urn) 

Fig. 13. SQUID system output U{x) in dependence on the position of the test body 
for various currents 7o in the pick-up circuit. The numbers at the curves indicate the 
amount of the current Iq in mA. 

to the displacement {xi — X 2 ) of the two test masses, but only for small dis- 
placements xi and X 2 - The detector can be balanced for all positions x\ and 
X 2 using inductance characteristics corresponding to (7), but geometries having 
such dependencies in a large range are difficult to realize. 

4 Experimental Results 

4.1 Inductance Measurements 

Beside theoretical determinations of Lq and dL/dx we used a special experimen- 
tal equipment to measure the inductance of a pick-up coil in dependence on the 
distance of a cylindrical test body with the help of a commercial LCR meter. An 
example of the characteristics measured is given in Fig. 12. In this case the coil 
was hand- wired from a 0.3 mm niobium wire applying 8 turns. The geometry 
of the coil and the body was a plunger-type one. The inductance in the initial 
position {x = 4000 /xm) is 1.2 /xH and a is determined by differentiating the 
curve in Fig. 12 to be 0.45 nH//xm. These results are in good agreement with the 
expected values. 

4.2 Performance of the Detector 

The performance of the DC SQUID position detector was investigated in ground- 
based experiments. Therefore a special measurement equipment allowing us to 
move the test body relative to the pick-up coils was used. It consists of a cylin- 
drical test body with superconductive coated jacket and the two pick-up coils 
at each end. In order to determine the sensitivity (3 = dU/dx of the detector nu- 
merous characteristics U{x) for different currents Iq in the pick-up circuit were 
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recorded. Fig. 13 shows an example for these characteristics. The unusual high 
voltages come from editing the curves to avoid the reset branches of the SQUID 
controller if the output voltage exceeds the maximum value by shifting segments 
of the curves in the voltage direction. According to Eq. (13), the sensitivity in- 
creases with higher currents in the pick-up circuit. Differentiating these curves 
one can get the maximum sensitivity which also increases with the current Iq. 
This dependence is shown in Fig. 14. 

On the basis of the measured sensitivity, the noise limited resolution of the 
detector (Fig. 15) can be specified using the fiux resolution 5<l>sQ of the SQUID 
and the transfer function dU/d<P of the SQUID system. This correlation is ex- 
pressed by the following equation: 



6x = 



1 dU 
/3 d<P 






(17) 



Inserting the fiux resolution of 6 x 10“® (including some environmental 

noise, in contrast to the value 2x10“® ^o/'x/Hz described in section 3.2) and the 
fiux sensitivity of 2.5 V/d>o of the SQUID system used one obtains a position 
resolution of 

5x = AxIQ-^^^L= (18) 

vHz 

assuming a current I = 200 mA through the pick-up circuit. For comparison, 
the radius of a uranium nucleus is in the same order of magnitude. 



4.3 Ftee Fall Measurement System 

After performing many free-fall experiments with a simplified system having 
only one single test body, a new system with two test bodies was completed and 




Current (mA) 



Fig. 14. Sensitivity of the SQUID position detector in dependence on the current Jq 
in the pick-up circuit. 
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has been successfully tested in first free-fall tests. Fig. 16 represents a simplified 
drawing of the new two-body system. The two concentric test bodies (a) and (b) 
are made of lead and aluminium (coated with niobium), respectively. They are 
stabilized in the radial direction by meander-shaped coils and can be controlled 
in the vertical direction by levitation coils. These coils are supported by special 
carriers (c), (d), (e). For instance, (f) denotes the upper levitation coil for the 
outer test body and (i) the lower levitation coil of the inner test body. The pick- 
up coils required for the position measurement are also attached close to the 
test bodies. For each body two pick-up coils are needed as described above, (h) 
shows the lower pick-up coil of the inner test body and (g) the upper pick-up coil 
of the outer test body. Furthermore all parts are shielded by superconducting 
screenings (j). The whole system is housed in a vacuum chamber immersed in 
liquid helium at 4.2 K. 

The cryostat (20 1 liquid helium), the SQUID electronics, the data processing 
unit, the control unit and the power supply are mounted in a special structure, 
the so-called free-fiyer (see Fig. 17), which resists the vibrations and shocks at 
the beginning and at the end of the drop. The free-fiyer itself falls freely inside of 
the outer fall capsule, which contains a computer-controlled caging mechanism. 
Despite the weightlessness and the rapid deceleration of about 500 m/sec^ (50 
g) after numerous drop experiments neither the cryogenic equipment nor the 
SQUID electronics have been damaged. 

4.4 Ftee Fall Tests of the Measurement System 

Experiments were done to improve the test body position control unit. This unit 
fixes the test bodies at the beginning of the drop. Before release, the hanging 




Fig. 15. Resolution of the position detector in dependence on the current 7 q in the 
pick-up circuit. 
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Fig. 16. Simplified assembly drawing of the cryogenic part of the measurement system 
for testing the Equivalence Principle including two test bodies, levitation coils, and 
pick-up coils for position measurement. See text for further explanation. 
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Fig. IT. Scheme of drop capsule (left) and photograph of free-flyer (right). 



capsule is under tension from its own weight resulting in a contraction upon 
release. This would result in an upwards acceleration of the test bodies. After 
vibrations of the system have faded away, the control unit has to center the test 
bodies in their working position as fast as possible to attain a long measurement 
time for determination of the Eotvos coefficient. For this purpose the control 
unit measures the position and the velocity of the test bodies and sends short 
rectangular shaped current pulses of a calculated length and amplitude through 
the upper and the lower levitation coils, respectively. These coils situated at both 
ends of each test body can repel the superconducting test bodies. Theoretically, 
two pulses from opposite directions (upper and lower levitation coil, respectively) 
are sufficient to attain centering even given an initial velocity. However, after the 
two pulses there are small deviations from the aspired position and velocity due 
to errors in determination of the initial velocity and position or due to limitations 
of the control unit. Therefore, it will be necessary to repeat the procedure several 
times. For precision experiments the final velocity of the test bodies relative to 
the measurement system should be lower than 0.25 ^m/sec to avoid exceeding 
the working range of the position detector (see section 3.3). 

Fig. 18 shows a preliminary result of a free-fall test for the position control of 
the inner test body. At the beginning of the drop the test body was magnetically 
fixed by a current of 1 A through the upper levitation coil. The following test 
sequence starts with a current pulse of 0.5 A through the lower levitation coil 
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time [s] 

Fig. 18. Controlling of the inner test body by short pulses through the levitation coils 
during a drop; SQUID Signal (upper curve, 14 V = 1 mm displacement); currents of 
the levitation coils (lower curves) measured by means of current-voltage converters 
(1 V = 1 A). 



accelerating the test body. After 40 msec the test body is decelerated again by 
a pulse through the upper coil. Both pulses have the same length (30 msec) and 
amplitude. Despite of same currents and simular geometry the upper and lower 
levitation coils cause slightly different forces on the test body. Therefore, the 
test body has not been stopped by the second pulse of the same length but has a 
residual velocity. This velocity is measured by the control unit and a short pulse 
is send to the lower levitation coil stopping the test body as it is clearly seen in 
Fig.l7 {t > 1.92 sec). 

After both test bodies have been stopped with a sufficient precision in the 
final experiments, the sensitivity of the detectors have to be switched to a higher 
sensitivity and resolution (but smaller working range) and the determination of 
Eotvos coefficient can start^. The position control unit is switched off for the 
rest of the drop. 

If a higher sensitivity is used, one would see that the test bodies don’t stay 
in rest after the controlling process, as it seems in Fig. 18, but appear as small 
(< 10 “®( 7 ) acceleration caused by the residual acceleration of the drop capsule. 
This acceleration should be equal for both test bodies if ry = 0. Beside the 
acceleration, the test bodies have a different initial velocity caused by limitation 
of the controlling process at the beginning and an unknown initial displacement. 

In order to determine the Eotvos coefficient, the displacement of the two test 
bodies has to be measured from the end of the controlling process to the end of 

® It is also possible to use two different sensors for each test body. One sensor possessing 
a large working range (0.5 mm) is used for controlling the test body. The other 
possessing a high resolution is used for the determination of the Eotvos coefficient. 
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the drop^. We expect a linear dependence of the displacement on time (caused 
by the different initial velocities) (if ry = 0), or a parabolic dependence (if i] 0). 
Theoretically, three measurement points would be sufficient to determine rj. But 
the measurement curves will be superimposed by the noise of the detector and 
the environment (magnetic disturbances, vibration of the capsule). Therefore, it 
will be better to measure much more points to get the highest possible resolution 
for t] limited by the noise spectrum and the measurement time. The sampling rate 
of the measurement point needed to attain an optimal resolution of rj depends 
on the bandwidth of the detector. Both parameters are chosen on the basis of 
the dynamical performance of the A/D-converter, the memory capacity of the 
data recording unit and the measurement time. In the final experiments we will 
use a 24-bit A/D converter® with a sampling rate of 2 kHz and a high-order 
lowpass filter with a cut-off frequency of 500 Hz. 

5 Conclusions 

Due to their exceptional sensitivity and universality, DC-SQUIDs made of clas- 
sical low-Tc materials like niobium and lead can be applied in position detec- 
tors providing an unusually high position resolution. On the basis of a set of 
two wire-wound plunger-type pick-up coils, the position of a superconducting 
test body could be measured with a noise limited resolution of the order of 
4 X 10“^^ m/-\/Hz. Using this sensor technique, gravitational experiments can 
be performed such as the test of the validity of the WEP on earth by using drop 
tower facilities or in space like the current STEP project of NASA/ESA. 

On the basis of this type of position detector, it would be possible to prove 
the validity of the WEP with an accuracy of about rj = 10“^® on earth by using 
the drop tower facility in Bremen. Because of disturbing effects (such as magnetic 
background fields, mechanical vibrations of the drop capsule after release, liquid 
helium tides, etc.) the accuracy will be limited to an estimated level of ry = 10“^®. 
Nevertheless, this would be more precise by at least one order of magnitude than 
the most accurate ground based experiments performed by Roll, Krotkov, and 
Dicke [1,2] using modern torsion balance equipments. 

On the other hand, drop tower facilities like the Bremen Drop Tower seem to 
be an appropriate and excellent tool for the ground based experimental test and 
characterisation of all components of SQUID based position detector systems as 
required, e.g., for the proof of the validity of the WEP in space. 
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Abstract. In view of space missions, for accurate recovery of the Earth gravity field, 
for the test of the equivalence principle and for the observation of gravity waves in 
particular, specific inertial sensors are developed exhibiting very high resolution and 
limited full scale range snited for in orbit operation. These sensors are constructed 
around a high density proof-mass with a very fine and stable silica gold coated core. The 
proof-mass position and attitude are measured with highly sensitive capacitive sensors 
and are controlled with electrostatic actnators. The configuration and the major design 
parameters of these instruments are described in relation to the expected performances. 
The present status of the development of these instrnments is shown together with the 
associated space mission scientific objectives. The main experimental results obtained 
during the ground qualification of these accelerometers are also presented. 



1 Introduction 

Space missions are not only dedicated to the discovery and the observation of 
the universe, as in astrophysics or in planetary sciences, but space provides 
also a specific microgravity environment, the advantage of which is exploited in 
crystallography, biology, and physics experiments. Moreover, it provides a fine 
possibility of global and accurate observation of the Earth. 

For the micro-gravity experiments, the acceleration levels of the space plat- 
forms have to be monitored especially in the very low frequency range from 1 Hz 
down to the orbital frequency of about 1.7 x 10“^ Hz, and with a sensitivity bet- 
ter than the micro-g. Such a sensitivity must even be increased for the control of 
dedicated isolated bench [1] or for the control of spacecraft. The DC acceleration 
and the vibration levels on board the Spacelab shuttle have been monitored [2] 
in orbit to less than 10 micro-g (1 g ~ 9.8 m/s^) mean value, depending on the 
orbit altitude between 300 km and 500 km and on the orientation of the shuttle 
with respect to its velocity vector. The fluctuations in the 1 Hz bandwidth reach 
100 micro-g and time to time 1 milli-g while the local vibrations increase at 
upper frequencies up to a hundred of milli-g. This is far from the simple idea of 
punctual weightlessness. In fact, the spacecraft is not only submitted to the or- 
bital gravity field but also to the atmospheric drag of the Earth (or the planet), 
the radiation pressures from the Sun and the Earth, the magnetic forces induced 
by the coupling between the planet’s magnetic field and the spacecraft charge 
(Lorentz force), or its magnetic susceptibility or momentum. Furthermore, when 
not at the centre of mass, the effect of the gravity gradient tensor has to be 
taken into account as well as the angular and the centrifugal acceleration. 
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Table 1 provides evaluation of the amplitudes of these accelerations for two 
recently designed satellites, the first one at very low altitude for a geodesy mis- 
sion, the second one at higher altitude for space fundamental physics. Obviously, 
at 36 thousand kilometres, the geosynchronous satellites are no more affected by 
the atmospheric drag and the Earth’s gravity gradient. 

For what concerns the International Space Station, it is designed to provide a 
space laboratory micro-gravity environment limited to 2 fj,g for frequency lower 
than 0.1 Hz and with a / law increase for upper frequencies [3]. 



Table 1. The Esa GOCE mission is devoted to the Earth gravity field recovery (launch 
foreseen in 2004) with a 800 kg satellite; the ONES /rSCOPE mission aims at the 
Equivalence Principle test (launch foreseen in 2004) with a micro-satellite of 120 kg; 
both missions require a very fine orbit and altitude control. 





GOCE satellite 


/iSCOPE satellite 


altitude 


250 km 


700 km 


atmospheric drag 


1.5 X 10“® m s"^ 


6 X 10“® ms"^ 


radiation pression 


6.1 X 10“® m s“^ 


3.7 X 10"® m s"^ 


gravity gradient 


4.1 X 10~® m s"Vm 


3.4 X 10“® m s"Vm 


angular acceleration 


1.4 X 10~® m s"Vm 


1.1 X 10“® m s"Vm 


centrifugal acceleration 


1.5 X 10“® m s“^/m 


3.3 X 10“® m s“^/m 



In order to finely measure these environments, specific sensors exhibiting very 
low bias and high resolution, hundred times better than the accelerometer used 
in aircraft for inertial navigation have to be developed. 

For industrial applications, accelerometers exhibiting a full scale range of up 
to several hundreds of g and exhibiting parts of g sensitivity are available [4]. 
The piezoelectric accelerometers exploit the electrical potential variations of a 
piezoelectric device which internal constraints are linked to the relative motion 
of an inertial mass. These very robust and small instruments present very large 
bandwidth but no measurement at low frequencies. 

For aircraft navigation, the accelerometer accuracy has to be better than 
one hundred micro-g, after data processing taking into account the instrument 
calibration and thermal sensitivity, in the important range of temperature from 
— 50°C to -1-90° C [5]. Most of these accelerometers rely on a spring-mass or 
a pendulum-mass device with a position sensing of the mass. When the mass 
motion is servo-controlled through magnetic actuators for instance, the acceler- 
ation of the instrument frame is deduced from the measured applied force. The 
stability of the spring or of the hinge which sustains the mass is of peculiar im- 
portance in order to obtain the accelerometer bias stability [6]. The servo-loop 
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operation leads fortunately to one single point condition of operation (deviation 
and constraint) to the benefit of the behaviour stability. 

New generation of vibrating beam accelerometers are now proposed for air- 
craft navigation. The natural mode of vibration of a silica or a quartz beam 
varies with the applied constraints which are induced by a mass fixed at one 
of its extremity. The vibrations are excited by piezoelectric or capacitive means 
and the oscillator frequency is measured with a counter. These accelerometers 
exhibit now better than one hundred micro-g accuracy in a large operating range 
of temperature while the resolution can be in the order of one micro-g. The main 
interests consist in their easy way of production, few cm^ sizes and the already 
digital outputs [7]. 

All these accelerometers can be adapted to some space applications but 
present for the quasi-steady acceleration a non sufficient resolution because of 
their too important full range and bias, not suited to the in orbit acceleration 
levels. 

Furthermore, scientific missions need much better performance. In the field of 
the Earth’s observation, two main applications of the ultra-sensitive accelerom- 
eters merge. First, the measurement of the surface forces acting on low Earth 
orbit satellites allows to eliminate the non gravitational effects in the accurate 
determination of the orbit. This can be simply exploited for the autonomous 
navigation of the satellite, the orbit of which is then predictable. Associated to a 
fine trajectory recovery from GPS receiver or Doris on board system, this yields 
to the determination of the Earth’s gravity field as in the CHAMP project or 
the DORIS D micro-satellite project [8]. For the determination of the gravity 
anomalies at smaller geographical scale, from wave lengths of several thousands 
down to one hundred kilometres (and so of the higher spherical harmonics of 
the gravity potential) two other techniques are now elaborated, the satellite to 
satellite tracking and the gravity gradiometry [9], [10]. For both, accelerometers 
with expected resolution up to the pico-g or better are necessary [11]. 

Future scientific missions in the domain of Fundamental Physics require also 
satellites flying drag-free, which means that all non-gravitational disturbances 
acting on the satellite are compensated by throttling the thrust with closed-loop 
controls exploiting acceleration sensors. This is the case of the LISA mission 
satellites for the observation of the gravity waves [12]. The in orbit test of the 
Equivalence Principle is also proposed by considering ultra-sensitive differential 
accelerometers including two test-masses made of different materials [13], [14]. 
For these missions, the objectives of sensitivity are much better than the femto-g. 

These outstanding resolution cannot be achieved with instruments able to 
measure or even support in their nominal measurement mode one g. Specific 
space accelerometers must be conceived taking advantage of the in orbit en- 
vironment that much reduces the applied acceleration to sustain especially on 
board future drag-free satellites. 
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2 Accelerometers Dedicated to Space 

Aircraft pendulum accelerometers have been optimised for space applications; 
this is the case of the sensors of the SAMS system which has been used to 
characterise the shuttle microgravity [15]. In fact, the sensor core is quite not 
modified but the servo-loop electronics of the mass control can be changed in 
order to increase the accelerometer scale factor from 0.1 V/g to 100 V/g. The 
resolution is then modified from 0.5 /ig/\/Hz to about 50 nano- g/VSz as seen 
in Fig.l. 




0.01 0.1 1 10 Hz 

Fig. 1. The pendulum accelerometer is mounted on an antiseismic testing bench; the 
residual acceleration of the bench, which has a tilt natural mode around 0.3 Hz, is 
measured with the ASTRE space accelerometer (presented below) to less than Inanog 
for frequency lower than 0.1 Hz; the intrinsic noise of the QA 3000 is run printed out. 



To go further in resolution, electrostatic space accelerometers have been de- 
veloped since the seventies. The Discos and the Cactus instruments have been 
respectively launched in 1972 and 1975 for drag compensation control or drag 
measurement [16], [17]. Both was designed around a high density proof-mass, 
the motion of which is measured with three capacitive sensors. The Cactus ac- 
celerometer (Capteur Accelerometrique Capacitif Triaxial Ultra Sensible) was 
designed to measure the surface forces exerted on a satellite by the aerodynamic 
drag and the solar radiation pressures. The spherical proof-mass position is con- 
trolled but not the rotation. This simple design is well suited for high sensitivity 
but incompatible with high accuracy: no matter how carefully the instrument is 
built, the residual geometric or electric defects of sphericity make the sensitivity 
of the accelerometer fluctuates as the proof-mass turns. 

The Miniature Electrostatic Accelerometer (MESA) includes a thin-walled 
cylinder as a proof-mass with a thin central flange and operates in various space 
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missions [18]. Initially developed by Bell Aerospace as a one-axis accelerometer 
for rotating accelerometer gravity gradiometre, the MESA is presently exploited 
as a three axis space accelerometer taking advantage of its five degrees of freedom 
electrostatic “suspension”. It is in particular one of the sensors of the Orbital 
Acceleration Research Experiment (OARE) currently used by NASA to mea- 
sure the space shuttles on-orbit linear acceleration within the sub-microgravity 
regime. The resolution of the sensor is better than 10 nanog with 25 s integra- 
tion and its bias and scale factor are calibrated in orbit by rotating the MESA 
sensor mounted on a dual-gimbals platform [19]. The European QSAM (Quasi- 
Steady Acceleration Measurement) instrument follows the same approach with 
two pairs of QA 3000 pendulum accelerometers, each pair being mounted on a 
flipping device for calibration and associated with a static triad of accelerome- 
ters. Due to the use of QA3000, the resolution is of the order of 5 x 10“^ g [20]. 
The three instruments, QSAM, OARE and ASTRE have flown in 1996 during 
the same shuttle mission MSL-1. 

Superconducting accelerometers, with cryogenic magnetic levitation instead 
of room temperature electrostatic one, have been developed for ultra-sensitive 
gravity gradiometry [21]: the cylindrical mass being maintained in radial direc- 
tion with a blade, highly stable magnetic stiffness are induced between the mass 
and the instrument cage and along the axial direction by superconducting loops, 
associated with squid position sensing. Same type of instrument configuration 
are also being defined for three axis measurement but without any mechanical 
stiffness and with a three axis magnetic suspension [13]. 

3 Electrostatic Servo— Controlled Accelerometer 
Operation 

The principle of operation of a servo-controlled electrostatic accelerometer is 
based on the measurement of the electrostatic force necessary to maintain the 
accelerometer proof-mass motionless with respect to the sensor cage. For space 
applications, the proof-mass can be fully suspended in the three directions, 
suppressing any mechanical contact to the benefit of the resolution and yielding 
to a three axis accelerometer. 

The performance relies on the resolution of the proof-mass position sensor, 
the very limited stiffness that links the test-mass to the accelerometer cage 
and the very weak level of the test-mass motion disturbances, F^th due to the 
environment (magnetic, electric, thermal), and independent to the electrostatic 
suspension. Let us consider the expression of the test-mass motion Xmass inside 
the sensor cage: 

5 ^mass ~ 9 F Telec A (^mass ^cage) and S Xcage — Tgpcft (1) 

with s the Laplace derivative variable, Teiec the acceleration from the electro- 
static suspension, g the gravity held and Wp the angular frequency associated to 
the parasitic uncontrolled stiffness. Then the measurement of Teiec provided by 
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the accelerometer leads to the acceleration of the spacecraft 



d^elec — 



G 



G 



Loop 



Loop 

+ ± 



^Apcft — Atb + (s^ ± COp) , 



(2) 



with Gloop the gain of the servo-loop which controls the proof-mass degree 
of freedom Xmass, Gloop ^ 1 at low frequencies and with Xnoise, the capacitive 
sensor noise expressed in displacement according to the geometry of the config- 
uration electrodes-mass. 

One of the advantages of the electrostatic suspension is the use of very weak 
acceleration with measurable level of applied voltages: electrostatic actuators 
with sensitivity lower than one nano-g per Volt can be achieved. Furthermore, 
when the position of the mass is servo-controlled, the difference a;mass — a^cage is 
almost null in the frequency bandwidth of the suspension where the gain of the 
servo-loop is important. Then, the optimisation of the sensor-head configuration 
which aims at reducing /^tb and at producing well identified Tgiec is made easier 
by the steady geometrical configuration. 

Because the electrostatic forces remain normal to the faces of the mass, as- 
sumed to be a perfect conductor (then an electrical equipotential), a cubic proof- 
mass with parallelism and perpendicularism deviations less than one arc second 
ensures very low coupling between the three pairs of servo-control channels that 
keep it motionless in position and attitude. And thus low coupling of the sensitive 
axis measurements are obtained. 

The pair of electrodes corresponding to each loop can be used for both capac- 
itive position sensing and electrostatic restoring force generation (Fig. 2). From 
the measurement of the capacitive sensor, a corrector determines the opposite 
drive voltages to be applied on the opposite electrodes for the generation of 
the electrostatic field. A unique sine wave pumping signal Vd is applied to the 
proof-mass for the six capacitive sensors. A biasing reference voltage can be 
also applied to the proof-mass or to both electrodes in order to make linear the 
electrostatic actuators. Both electrodes attract the proof-mass with forces Fi 
and F 2 proportional to the gradient of the capacitance and to the square of the 
electric potential differences between the proof-mass and the electrodes. The 
resulting force F (F = Fi + F 2 ) is expressed by: 

F=^ (vG2 (V 2 - Vpf + VGi (Vi - Vpf) = mFeiec (3) 



In case of a perfect configuration symmetry with: 



Z\G2 = -AGi = AC Vi = -V 2 = V (4) 



the resultant force F is proportional to V: 

Felec= -F = 2VCVpV (5) 

m 

The loop provides a cold damping of the proof-mass motion and a strong 
control at frequencies lower than the closed loop natural frequency, /o^. The 
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= -y + Vp 



Fig. 2. Scheme of one servo-loop channel. 



output network provides the measurement of the actual applied voltage V which 
constitutes the analogue output of the accelerometer. ff(s) being the linear re- 
lation provided by the electronics between V and the capacitive sensor output, 
the expression of the measure is: 



Measure = Feiec = 



H{s) 

H{s) -I- ± 



(-^spcft H “h ib ^noise) 



(6) 



and the expression of the mass displacement: 



— ^cage 



H{s) 

H{s) -b ± 



H{s) -b ± w' 



“ (-f^spcft -^dtb 1?) • 



(7) 



The electrostatic suspension bandwidth can be selected much larger than the 
measurement frequency domain without affecting the accelerometer resolution. 
Then, the accelerometer frequency response is flat (first term of the expression 
of the measure is almost one) and the proof-mass motion is very limited to the 
benefit of the accelerometer linearity and stability of characteristics. 



4 The ASTRE and STAR Accelerometers 

On the basis of the previously described concept, several configurations of the 
sensor-head with its associated electronics have been developed according to the 
mission requirements. 

ASTRE (French acronyms of Accelerometre Spatial TRiaxial Electrosta- 
tique) has been optimised for the monitoring of the manned spacecraft envi- 
ronment and its configuration is well suited to realise a three axis accelerometer 
with the possibility to perform ground tests under normal gravity [22]. 
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Fig. 3. ASTRE-STAR sensor-head. 



The mechanical core of the sensor is defined with a silica core surrounding a 
72 grams parallelepiped proof-mass made of titanium alloy, 4 cm side and 1 cm 
height (see Fig. 3). The core is internally gold coated and the electrodes face the 
proof-mass by pair. The mass is ground with parallelism and perpendicularly 
deviations less than 10“® rad, that ensures the high uncoupling between the three 
axes. The asymmetry of the configuration (not a cubic proof-mass) is required to 
perform the electrostatic suspension of the mass under one g, the electrode areas 
being larger along the vertical axis and the distance between the proof-mass and 
the electrodes being reduced to a few tens of microns. An electrostatic field as 
strong as 3 X 10^ V/m can be necessary to sustain the proof-mass. The core 
is integrated inside a tight housing ensuring a secondary vacuum of 10“® mbar 
and a magnetic shield to the benefit of the mass motion disturbances. 

According to the mission, the ASTRE configuration can be slightly modified: 
the range of the accelerometer can be settled by changing the proof-mass mate- 
rial (the proof-mass density can vary from 2.2 g/cm^ for Silica up to 20 g/cm^ 
for Platinum) without modifying the silica core; the proof-mass sizes can also be 
modified and thus the gaps between the electrodes and the proof-mass can be 
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adjusted between 30 and 1 mm; the strength of the electrostatic actuators 
being reduced, the accelerometer sensitivity can be increased. 

The analogue servo-loop electronics are designed around six capacitive sen- 
sors associated to six electrostatic actuators. The sensitivity of the capacitive 
position sensing must be settled to the electrode configuration and sizes. The 
areas of the two z electrodes are for instance 2 cm^ and the distance to the mass 
is 75 fim leading to capacitance of 24 pF. With a sensitivity of 0.3 V//rm, the 
measured resolution of the position sensing is 2 x -I- 0.4//)^/^ m /VHz as 

shown in Fig. 4 and with a negligible back-action of the electrical sensing signals 
on the mass acceleration. 
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Fig. 4. Capacitive position sensing resolution with ASTRE accelerometer configuration 
and electronics; -120 dB equals 1 /iV sensor output i.e. 3.3 picometre 



In spite of the differences of the geometry, the characteristics of the three 
outputs of the ASTRE accelerometer are very similar with a resolution of 1 nano- 
g and a full scale range of one milli-g.The ASTRE accelerometer flew three times 
on board the Columbia shuttle in 96 and 97 for the missions STS-78, STS-81 
and STS-81R. Integrated inside its host system, the Microgravity Measurement 
Assembly (MMA), ASTRE collected data during the whole missions. Fig. 5 shows 
the statistic distribution of the measured acceleration during the third mission, 
demonstrating that the microgravity level is unfortunately not reached during a 
wide window [23]. 

The Fig. 6 illustrates the impact on the Spacelab acceleration in translation 
of the actuation of the Columbia shuttle thrusters during the attitude control 
phase of the shuttle. It is to be noticed that the average level is very close to 
zero demonstrating the weak instrument bias. 

From the ASTRE experience, the STAR accelerometer has been defined for 
the German CHAMP mission (CHAllenging Micro-satellite Payload for geophys- 
ical research and application) dedicated to Earth’s observation. Five scientific 
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MSL-1R : ASTRE along Zast axis (XB axis) 




GMT day 



Fig. 5. Distribution of time percentage per mission day when the measured acceleration 
is lower than 10 ^g, between 10 and 100 /rg, 100 fig and 1000 fig or larger 



instruments constitute the satellite payload: two magnetometers for the global 
and accurate Earth’s magnetic field mapping, one digital ion drift meter to de- 
liver complementary measurements of the Earth’s electrical field, one GPS dual- 
frequency receiver contributing to the atmospheric analysis, density, pressure 
and temperature sounding but devoted first to the precise orbit determination 
with centimetre accuracy and one STAR (Space Three-axis Accelerometer for 
Research) accelerometer [24], [25]. 
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Fig. 6. Measured Spacelab acceleration during the throlling of the shuttle attitude 
control thrusters 
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Fig. 7. STAR sensor unit before integration in the CHAMP satellite 



The CHAMP satellite is scheduled to be launched in April 2000 by a Russian 
Kosmos rocket at an altitude of 550 km in a circular orbit with a 100 degree 
inclination, the eccentricity being less than 0.01. During the five years of the 
mission, the orbitography will be finely performed and the accelerometer, inte- 
grated at the centre of mass of the satellite with an accuracy of a few millimetres 
shall measure in a frequency bandwidth from DC to a few tenth of Hertz the 
air drag, the solar and Earth radiation pressures and the attitude manoeuvre 
effects aiming at the Earth’s gravity field up to degrees and orders of about 50. A 
by-product of the accelerometer measurements will be the determination of the 
atmospheric density variations, in particular at the lower altitude of the mission. 

STAR has an internal core configuration similar to ASTRE. This six-axis 
accelerometer provides the three linear accelerations along the instrument sen- 
sitive axes and the three angular accelerations about these axes. STAR presents 
a measurement range of 10“^ m s“^ and exhibits a resolution of better than 
3 X 10“® m s“^ for the y- and z-axes and 3 x 10“® m s“^ for the x-axis within 
the measurement bandwidth from 10“'* Hz to 10“* Hz. The measurements are 
integrated over one second before delivery to the satellite data bus. In orbit, 
STAR a:-axis is vertical from the Earth, while the y-axis is normal to the orbit 
and the z-axis is along track. The Fig. 7 shows STAR stand-alone instrument 
constituted by a sensor core depicted previously and integrated inside a tight 
housing with servo-loop electronics boxes around, all packaged in the same unit. 
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Fig. 8. STAR resolution as demonstrated with ground tests (left) and free fall tests 
in drop tower (right); the accelerometer is measuring in both case the residual level of 
vibrations of the testing bench maintained horizontal or of the falling capsule 



Its total mass is 7.8 kg, inside a volume ofl9.5xl8.5xl6.5 cm^ and its nominal 
power consumption is 1.7 W. 

The STAR performance has been verified in laboratory as far as it is possi- 
ble in presence of seismic vibrations and with a mass levitation under one g by 
specific booster electronics. The y and z accelerometer axes being maintained 
horizontal, measurements of residual low frequency vibrations of the testing plat- 
form have been performed with a resolution of 10“® m s“^/\/Hz at frequencies 
lower than 0.1 Hz corresponding to the requirement of 3 x 10“® m s“^ rms in 
the measurement bandwidth (see Fig. 8). The accelerometer low level of bias (less 
than 10“® m s“^) has been also verified in free fall in the Bremen drop tower 
[26]. 
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Fig. 9. Diamond configuration gradiometer. 



5 From CHAMP to GRACE and GOCE Instrument 

With an up-date of the STAR configuration by modifying the core geometry, the 
position sensing and the electrostatic actuator sensitivities, the SuperSTAR ac- 
celerometer has been defined for the Gravity Recovery And Climate Experiment 
(GRACE) mission [27]. This instrument presents two modes of operation with 
two different ranges: the first one is more robust ensuring the mass levitation 
with a strong control, the second one corresponds to a full measurement range 
of 5 X 10“® m s“^ with a resolution of 10“^° m s“^/-\/Hz. Two flight models 
of SuperSTAR accelerometer are being produced in order to be integrated in 
the two GRACE satellites in the first half of 2000. The mission aims at the 
recovery of the Earth’s geoid with an accuracy of 3.5 mm and a geographical 
resolution of 200 km. The satellite to satellite tracking performed with a micro- 
wave link at altitudes between 500 km and 300 km provides the measurement of 
their relative velocity with an accuracy better than 1 /rm s“^ depending on the 
over flight gravity anomalies and on the satellite surface forces measured by the 
accelerometers [28]. 

As a complement to the previous JPL mission, the Gravity and Ocean Cir- 
culation Explorer (GOCE) mission has been selected in 1999 by the European 
space agency in the frame of the Earth explorer program [29], [30]. On board 
a drag-free satellite, at an altitude as low as 250 km, the gravity gradiometer 
shall provide the accurate measurements of the Earth’s gravity gradient tensor 
components for the determination of the upper harmonics of the potential [31]. 
This gradiometer is composed of six identical electrostatic accelerometers in a 
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diamond configuration and accommodated on a rigid and stable Carbon-Carbon 
structure inside a double chamber thermal enclosure. The theoretical models of 
the accelerometer operation at room temperature, which are based on the ex- 
perimental results obtained with the STAR and SuperSTAR qualifications, have 
been exploited to demonstrate the adequacy of such an instrument for the con- 
ditions of the very soft environment of GOCE satellite. With maximum levels 
of acceleration less than 10“® m s“^, due for a large part to the Earth gravity 
gradient itself, and temperature variations lower than 10 mK over the orbit, the 
expected resolution of the accelerometers is 4 x 10“^^ m s“^/-\/Hz in the fre- 
quency bandwidth from 5 x 10“^ Hz to 0.1 Hz. Taking into account the 10“® 
rejection factor of the residual satellite acceleration in translation and attitude, 
the gradiometer resolution is evaluated to better than^ 5 mE [32]. 

6 Electrostatic Accelerometers to Test the Equivalence 
Principle in Space 

The objective of the /rSCOPE space mission that we presently propose to CNES 
(French national Agency) is to improve by a factor one thousand the test of 
universality of free fall leading to an accuracy of 10“^® [33]. The laboratory 
experiments for the Equivalence Principle (EP) test exploit torsion pendulum 
and have to deal with the environmental instabilities and in particular the Earth 
gravity gradient fluctuations [34] . Recent results have been obtained by consider- 
ing the Moon-Earth laser ranging data but the material composition of the two 
bodies is not sufficiently well known [35]. Space experiment takes advantage of 
the in orbit soft conditions and the possibilities of long term signal integration. 
The usual ‘Galileo type’ experiment consists of comparing the relative motion of 
two free falling test masses made of different materials. Another way to perform 
the EP test is to control at null the relative mass motion (the masses having a 
free common fall motion in orbit) so that any EP violation appears through the 
measured forces necessary to nullify this relative motion. Then a much better 
management of the environment disturbances is possible because of the steady 
configuration. This is why we propose to perform this experiment on board a 
drag-free satellite with two servo-controlled electrostatic accelerometers, one 
implemented inside the other one [36]. Thus, the two test-masses are centred 
and submitted to the analog Earth and satellite gravity field: the ratios Mg/Mj, 
gravitational mass Mg over the inertial mass M,, of the test-masses made of 
different materials will be compared . The /iSCOPE payload is in fact composed 
of two of these differential accelerometers operating at room temperature: two 
pairs of test masses are then compared, one with the same material, the second 
with two different materials leading to a double differential measurement to the 
benefit of the rejection of the common motion disturbances. The satellite spin 
about the perpendicular axis to the orbit varies the orientation of the Earth 
gravity field in the instrument reference frame leading to a research of the viola- 
tion signal at a well known frequency and helps to the discrimination from other 
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effects like thermal or gravity gradient ones. With a signal integration over 20 
orbits (10^ seconds), the difference of the two test-mass accelerations shall be 
measured with a resolution of 10“^^ m s“^/\/Hz in the vicinity of the sum of 
the orbital and spinning frequencies, about 10“^ Hz [37]. 

The expected matching of the characteristics leads to a 10“^ rejection of the 
accelerations applied in common mode on the two concentric accelerometers and 
the necessary drag compensation system of the satellite must then reduce these 
accelerations to levels of less than 10“® m s“^/\/Hz. 

The cylindrical test masses present diagonal inertia matrices to limit the 
effects of the satellite and Earth gravity gradient and the common revolution 
symmetry axis is the EP test sensitive axis. The electrode configuration presented 
in Fig. 10 allows the measurement and the control of the 6 degrees of freedom 
of the two test-masses. While the eight quadrant electrodes associated by pairs 
allow the control of the radial translations and rotations, the two cylindrical 
sensing electrodes at the ends of the test-masses are used to control motionless 
the test-masses along the axial direction quite independently to the mass relative 
position that is adjusted in orbit to a few microns. 

The 10“^^ m s^^/v^Hz expected resolution of this instrument is deduced 
from the three major noise sources. At 
lower frequencies, the thermal instabil- 
ities induces radiation pressure and ra- 
diometer acceleration fluctuations due 
to residual gas for the latter. At higher 
frequencies, the position sensing res- 
olution affects the resolution with a 
square frequency law. Between, the 
thermal noise of the mass motion is 
derived from the damping factor esti- 
mated from dedicated laboratory ex- 
periments and mainly due to the thin 
5 /rm wire used for the charge control 
of the mass [38], [39]. 

7 Space Gravity Wave Antenna 

The LISA space mission (Laser Interferometer Space Antenna), under study by 
US and European Space Agencies, aims at the observation of the low-frequency 
astrophysical gravitational radiation with cosmological and fundamental physics 
objectives [12]. The interferometer is realised with a triangle formation of three 
drag-free spacecraft in heliocentric orbit. At the centre of the satellite, the proof- 
masses of inertial sensors are the mirrors at each end of the 5 x 10® km interferom- 
eter arms. The optical bench of each satellite is highly stabilised in temperature 
(10“® K/-\/Hz) and accommodated around the inertial sensors, the YAG laser 
beam being reflected directly on one side of masses. Then, the proof-masses 
constitute the inertial references, free of any acceleration disturbances, with an 




Fig. 10. Differential electrostatic ac- 
celerometer for the in orbit test of the 
Equivalence Principle. 
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expected level of a few 10“^® m s“^/-\/Hz in the very low frequency domain 
from 10““^ Hz up to 10“^ Hz . Furthermore, the inertial sensors are used by the 
drag-free satellite subsystem for the compensation of the drag due to radiation 
pressures. So, the satellite carries the optical bench and is a shield for the mass 
that the satellite follows accurately, at 10“® m/\/Hz, in the same low frequency 
bandwidth, in order to limit the mass disturbances, satellite gravity gradient for 
instance. 

As in the case of /rSCOPE configuration, this sensor presents a preferential 
axis (along the laser beam) and is optimised for a very weak range of operation 
thanks to the drag-free satellite. The accelerometer is built around a 1.5 kg 
bar test-mass made of Gold-Platinum alloy leading to a very weak magnetic 
susceptibility. Its cage is made of ultra low expansion (ULE) silica as well as 
its reference plate fixed on the optical bench taking advantage of the quite null 
coefficient of thermal expansion at the 25° C operating temperature [40]. 

The instrument housing is opened to the space vacuum in order to obtain a 
residual internal pressure less than 10“® Pa. The principle of operation of the 
sensor is similar to the previous ones except that the satellite drag compensa- 
tion system acts the electrical satellite thrusters in order to nullify the voltages 
that should be applied on the electrodes to keep the proof-mass motionless. In 
this way, no electrostatic forces will be applied to the proof-mass within the 
frequency bandwidth of the satellite control loop, larger than the interferometer 
measurement bandwidth. So, the inertial sensor has not to deliver an accurate 
measurement of acceleration with well known and steady scale factor but the 
configuration is focussed on the reduction of all parasitic forces. No discharging 
wire is envisaged to control the proof-mass potential for such a room tempera- 
ture sensitive sensor. So an active control of the mass charge is necessary. The 
Fig. II presents the silica core of the first laboratory model that has been in- 




Fig. 11. Inertial sensor for gravity wave space interferometer: first laboratory model 
core with a parallelepipedic configuration. 
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Electrostatic accelerometers developed at ONERA 
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Fig. 12. Electrostatic space accelerometer range and performance. 



tegrated with a silica mass in order to perform laboratory test under normal 
gravity: the levitation of the mass is performed and the electrostatic configu- 
ration is under test in order to demonstrate its compatibility with very weak 
induced stiffness, lower than 10“® N/m. 

8 Perspective 

One of the major advantages of space environment is the low level of acceler- 
ation disturbances. On one hand, the survey of these levels require specific in- 
strument, on the other hand, the dedicated space accelerometers take advantage 
of the specific range of operation and measurement. The concept of electrostatic 
servo-controlled accelerometer is well suited for these space applications: the 
electrostatic forces give the possibility to generate very weak but accurate ac- 
celerations while the capacitive sensing offers a high position resolution with 
negligible back reaction. 

The Fig. 12 summarises the different models that are under development or 
have been already produced for space missions. The large flexibility of config- 
uration and operation allow many mechanical sensor configurations. Moreover, 
the present development of digital servo-loop electronics will permit to cali- 
brate in orbit these instruments and to adjust the parameters of their operation 
[39]. This is now mandatory: the outstanding performance, needed by the future 
space mission s and expected because of the drag compensation system of the 
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satellite, are too far from the test accuracy in the laboratory, limited by human 
activities and Earth gravity field. The room temperature operation facilitates 
the integration on board the satellite. Nevertheless, cryogenic temperatures may 
be necessary to improve further the accelerometer resolution. This is why the 
operation of such an instrument at Helium temperature has been successively 
performed and prepare future physics investigations. 
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Abstract. The appearance of new fundamental forces and extra-dimensional modi- 
fications to gravity in extensions of the Standard Model has motivated considerable 
interest in testing Newtonian gravity at short distances (< 10“® m). Presently a num- 
ber of new gravity experiments are searching for non-Newtonian effects in the ranges 
~ 10“‘^-10“® m. However, as challenging as these experiments are, formidable new 
obstacles await the next generation of experiments which will probe gravity at dis- 
tances < 10~^ m where Casimir/van der Waals forces become dominant. Here we will 
review the motivation for conducting such very short distance gravity experiments, 
and discuss some of the new problems that may arise in future experiments. Finally, 
we suggest schematic designs for null experiments which would address some of these 
problems using the “iso-electronic” and “finite-size” effects. 



1 Introduction 

When Isaac Newton formulated his law of universal gravity over 300 years ago, 
he provided the first mathematical description of one of the fundamental forces 
of nature. Yet, physicists have realized only relatively recently that tests of New- 
tonian gravity can still provide a unique window into new physics [1-7]. Within 
the past 20 years, experimentalists have put Newtonian gravity to the test for 
distance scales 10“^-10^® m by searching for violations of the weak equivalence 
principle (WEP) and inverse square law (ISL). The fact that no such violations 
have been observed places stringent constraints on extensions of the Standard 
Model that would naturally lead to such effects [1]. Despite this effort, a number 
of authors have pointed out that very little is known of the validity of Newtonian 
gravity at distances Y 10“^ m [8-12]. Several experimental groups are currently 
attempting to extend these limits down to lO”"* m [9,13,14], which is near the 
point where Casimir/van der Waals forces overcome gravity to become the dom- 
inant force between neutral, non-magnetic bodies. This strong intermolecular 
force background will become a major challenge for experimentalists who at- 
tempt to probe gravity at much smaller distances. The purpose of this paper is 
to show that such experiments are worth the effort despite the new difficulties, 
and to suggest ideas which may be useful in detecting new forces of gravitational 
strength against a strong intermolecular force background. 

We begin by examining the theoretical motivation for studying Newtonian 
gravity and the phenomenology used to characterize non-Newtonian effects which 
would be the signal of new physics in a gravity experiment. After briefly review- 
ing the current constraints on new forces achieved by longer distance gravity 
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experiments (> 10“^ m), we will see that new problems arise when one at- 
tempts to set comparable limits in the Casimir/van der Waals regime (< 10“^ m) 
where intermolecular forces become large. We then investigate these problems 
quantitatively by computing the forces between two parallel plates over a range 
of separations. Finally, we propose two schematic designs for null experiments 
designed to subtract out the unwanted intermolecular and gravitational back- 
grounds using the “iso-electronic” and “finite-size” effects. These will hopefully 
allow one to search for signals of new physics at very short distances in the 
presence of Casimir/van der Waals forces. 

2 Theoretical Motivation and Phenomenology 

2.1 Overview 

The Standard Model currently provides an adequate description of the electro- 
magnetic, weak, and strong interactions within the framework of quantum field 
theory. However, a consistent description of quantum gravity has yet to be for- 
mulated despite intense work over the past fifty years. The lack of a quantum 
theory of gravity currently provides much of the motivation for studying exten- 
sions of the Standard Model which would bring all the fundamental forces into 
the quantum realm. In fact, many believe that the present Standard Model is 
really only an effective theory which would be superseded at much higher ener- 
gies by a more fundamental theory, such as string or M-theory [15,16]. One of 
the main problems with these more fundamental theories is that, despite their 
purported mathematical beauty, many of their principal consequences lay far be- 
yond the reach of most foreseeable experiments. It is therefore vital to investigate 
the low-energy limits of these fundamental theories to allow experimentalists the 
opportunity to constrain the proliferation of models which would otherwise go 
unchecked. 

It is against this backdrop that one should view recent and future experiments 
testing Newtonian gravity. Many extensions of the Standard Model, including 
string theory, contain new light bosons which would manifest themselves as new 
fundamental forces[l,6,7,12j. These new forces would compete with the other 
known forces, but they would most likely be revealed in a gravity experiment 
for several reasons. First, in many ways gravity remains the least understood 
of the fundamental forces and is relatively untested over a wide range of dis- 
tance scales. Second, any new forces probably couple very weakly with matter — 
otherwise they would have been seen already. Since gravity is by far the weakest 
fundamental force, it sets a natural scale from which to measure new weak forces. 
Third, there are two signatures of gravity which help one extract a signal from 
the background of other forces: 1) Since the gravitational force couples to mass, 
it obeys the weak equivalence principle (WEP), so violations of the WEP would 
indicate the presence of a non-gravitational force. 2) The Newtonian gravita- 
tional force between point particles obeys an inverse square law (ISL), hence 
any departures from the ISL might be attributed to new forces. Finally, New- 
tonian gravity is the weak-field, non-relativistic limit of General Relativity, a 
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theory in which gravity is seen as a manifestation of spacetime that has been 
curved by mass-energy. Therefore, any theory that impacts our understanding 
of space and time must involve gravity. This is important because string the- 
ory requires that there exist more than three spatial dimensions, the new extra 
dimensions being rendered invisible to current experiments by some yet-to-be- 
understood mechanism. As will be discussed below, recent models suggest that 
these new dimensions may modify Newtonian gravity at short, but macroscopic, 
distances. 



2.2 Yukawa Potentials 



The form of the violations of Newton’s law of gravity arising from new physics 
will be to some extent model dependent, but one finds in practice that most 
theories yield modifications that have similar generic features [1]. For example, 
suppose there exists a new vector field A^^{x) which couples to fermions via the 
Lagrangian density 

C{x) (1) 

Here / is the dimensionless vector-fermion coupling constant (h = c = 1) and 
ipix) is the fermion field operator. If two fermions 1 and 2 exchange a single 
vector boson with mass m via this coupling, the lowest order interaction in the 
non-relativistic limit yields a Yukawa potential 



K(r) = ± 



/1/2 

47t r ’ 



(2) 



where “-I-” indicates that the force is repulsive (attractive) between like 

charges. If this was electromagnetism, massless photons give m = 0, and for 
electrons / = — e. In the units we use, the range of the interaction is A = I/m, 
so that m ^ 10“^ eV gives A ^ I cm, for example. If the exchanged bosons 
were scalars instead of vectors, one arrives at an attractive Yukawa potential 
between identical fermions: 



Vs{r) 



/1/2 

47t r ’ 



( 3 ) 



where fi is now the scalar coupling constant. 

If the fermions have masses mi and m 2 , the total interaction potential in- 
cluding gravity and scalar/ vector interactions can be written in the general form 

= + (4) 



where 



ai2 = T 



/1/2 

47rGmim2 



( 5 ) 



The dimensionless constant 012 then characterizes the strength of the interaction 
relative to gravity, and its sign depends on type of boson exchanged. When 
r <C A, |o;i 2 | = 1 indicates a force of gravitational strength. 
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Yukawa potentials also arise naturally in models where new gravitational 
forces appear from extra spatial dimensions. String theory requires there to be 
more than 3 spatial dimensions, but until recently it was thought that all the 
extra dimensions were compactified on the Planck scale and thus invisible to 
any conceivable experiment. However, much attention recently has focused on 
a number of string-inspired models in which all Standard Model particles are 
confined to the usual 3 spatial dimensions (a 3-brane) while gravity can “see” all 
dimensions[17-21]. In such models, one would never expect to see the effects of 
extra dimensions in Standard Model physics, but their effects would appear in 
gravitational physics. Since so much of the parameter space of gravity remains 
unexplored, these effects could have easily escaped detection. For example, in 
models in which the extra dimensions are compact, it is possible that the com- 
pactification radius Tc could be as large as 10“^ m and thus would have not been 
seen in any experiment to date [17,18]. These models would produce dramatic 
deviations from Newtonian gravity at short distances since they imply that the 
r-dependence of the gravitational potential between point masses changes when 
the particle separation approaches Tc- 



{ Ga mi m2 ^ 

for r ^ rc 

( 6 ) 

G4+„mim2 

^ ^ 



Here n is the number of extra spatial dimensions, G4 = G is the usual macro- 
scopic Newtonian gravitational constant, and G4+„ is the more fundamental 
gravitational constant for the total A + n dimensional spacetime. Thus, in this 
model, Newton’s law of gravity is merely a projection of a more fundamental law 
of gravity onto 3 spatial dimensions, and the unusual weakness of gravity rela- 
tive to the other fundamental forces is attributed to this projection. One of the 
striking features of some recent string models is that compactification can occur 
over scales much larger than the Planck scale (1/Mpianck ~ 10“^® m) [17, 18,21]. 
For example, if in a 4 -|- n dimensional spacetime the fundamental mass scale 
fWfund ^ Mew, where Mew ^ 1 TeV is the 
the compactification scale Tc to be given by 

^ / Mpianck V^" 

A7fund V -^fund / 

which yields, 

( 1Q13 m, 

Cc ~ < 10“^ m, 

\ 10-® m. 



electroweak scale, then one expects 
[17,18,21] 

(10-19 m)(10i9)9/”, (7) 



n = 1, 

n = 2, (8) 

n = 3. 



Since no deviations from Newtonian gravity have been observed for r > 10“^ m, 
theories which suggest that Tc ^ 10“^ m (e.g., n > 2) are compatible with 
current experimental limits. As one tests gravity over smaller distance scales, 
the effects of new extra dimensions would first appear as corrections to the usual 
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gravitational potential. It has been shown that these corrections for r ^ Tc have 
a Yukawa form [18,22,23]: 

I^grav(r) - . (9) 

Here a„ is a composition-independent universal constant that depends on n and 
the nature of the compactification, and the range of the interaction is A ~ Tc, 
the compactification scale. For example, for n extra dimensions compactified on 
an n-torus, = 2n [18,22,23]. It is thus conceivable that the first evidence 
supporting the existence of extra spatial dimensions (and string theory) could 
come from the detection of a composition-independent Yukawa modification of 
Newton’s law of gravity. 



2.3 Current Constraints on New Yukawa Forces 



Let us now turn to the current laboratory constraints on new Yukawa forces 
which arise from a generic potential of the form, 



Vyir) = —a 



Gmim2 

r 



e 



— r/X 



(10) 



(Here we assume that the interactions are attractive for positive a.) This po- 
tential will lead to a violation of the WEP in a gravity experiment if a is 
composition-dependent, as in the case of a vector or scalar interaction. In addi- 
tion, even if a is independent of the composition of the test masses, as in the 
case of the extra dimension theories described earlier, small violations of the 
WEP will still be present in an experiment using different materials due to the 
“finite-size” effect [1]. This effect arises because a non-uniform Yukawa field will 
“capture” different fractions of two finite-sized objects having the same mass, 
but different densities, as will be the case in the null experiments considered 
below. Finally, in addition to violating the WEP, Vyir) will also violate the ISL, 
and so constraints on a can be inferred from tests of the gravitational ISL. 

As shown in Fig. I, the current experimental constraints on the Yukawa cou- 
pling constant a as a function of range A are quite stringent (allowed |a| <C 1) 
for 10“^ m ^ ^ ^ 10^^ m, but they fall exponentially outside this region. 
Composition-dependent experiments have set strong limits for specific couplings 
(e.g., to baryon number) when A > 10^® m [1], but also fall off exponentially for 
A < 10-3 m. 

As discussed in more detail in Ref. [9], current constraints allow a > 1 for 
A < IQ-3 m. For 10“^ m ^ ^ IQ-^ m, these limits were obtained from a test of 

the gravitational ISL by Mitrofanov and Ponomareva [24], but they still permit 
a new force with ot ^ 10^ for A ~ 10“^ m. However, a new round of gravity 
experiments [9,13,14] should fill in much of this region of parameter space within 
the next few years. At shorter distances, Casimir/van der Waals forces dominate 
gravity so the current limits are set by Casimir force experiments [9,25-27] and 
are much less restrictive than those obtained from the longer ranged gravity ex- 
periments. [See also Refs. [28,29] for detailed discussions on extracting limits on 
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Fig. 1. 2(7 constraints on the coupling constant a as a function of the range A from 
composition-independent experiments [1], The dark shaded area indicates the region 
excluded as of 1981, and the light hatched region gives the 1996 limits which remain 
current. 



new forces from Casimir force experiments.] The region A < 10“^ m will remain 
essentially unexplored until the next generation of experiments specifically ded- 
icated to search for new forces is designed and carried out. We turn next to a 
discussion of some of the difficulties likely to be encountered in developing this 
next round of experiments. 

3 Problems in Testing Gravity at Very Short Distances 

3.1 General Problems 

As noted in the Introduction, a number of authors [8-12] have called attention to 
the huge gap in our understanding of gravity at very short distances, and to its 
potential to reveal new physics. The fact that short-distance gravity experiments 
can potentially expose the presence of extra spatial dimensions is particularly 
tantalizing. However, since jaj 1 in these string models, the ultimate experi- 
mental goal is quite challenging, namely to set limits jaj < 1 for A < 10^^ m. 
To accomplish this, one has to be able to sense and distinguish a force of grav- 
itational strength at these distance scales. Since the current laboratory limits 
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in this region are orders of magnitude less sensitive than this goal, we discuss 
briefly some of the difficulties in studying gravity at short distances. 

An obvious problem is scaling [9]. Suppose we have two identical spheres of 
density p and radius R. If we wish to test the law of gravity between these spheres 
at short distances, the dimensions of the spheres have to be made comparable to 
the small distance scales that we are interested in probing. Since the minimum 
separation distance between centers is 2R, this leads to a maximum force 



F = 

m n nf' 



GiinpR^/Sy 

(2i?)2 



(X 



( 11 ) 



This example illustrates that the gravitational force between macroscopic ob- 
jects, which is already quite small, decreases rapidly with size and separation of 
the test masses. 

A second problem in searching for new short-ranged forces is that their short 
range limits the effective mass of a body that can participate in the interaction. 
Suppose we have a sphere of uniform density and radius R. Since gravity is a 
long-range force, another identical sphere close by will interact with all of the 
mass of the first sphere. However, if there exists a new force of range A <C i?, 
then only the layer of material of thickness ~ A on the surface of the sphere 
will interact with external objects, and hence only a fraction ^ A/i? of the total 
mass participates in interactions. But this problem is actually much worse in 
general. If we have two spheres nearly touching, it is only the mass within a 
range A of the contact point that interacts, which is much less than the fraction 
X/R, while the gravitational force is still felt by all the mass. Therefore, even 
if the new force is intrinsically of gravitational strength (a = 1) between point 
masses when r <C A, this new force between macroscopic bodies will usually be 
much smaller than the corresponding gravitational force. This situation was not 
encountered in previous longer range gravity experiments since in those cases, 
X> L, where L was the characteristic size of the test bodies used. 

A third problem occurs when the separation distance is ;< 10“"^ m, where 
intermolecular forces become significant. Since these forces have a power-law 
form 1 /r" [30] , where n depends on the geometry of the macroscopic bodies, they 
grow very rapidly as r decreases and overwhelm gravity at very short distances. 
Distinguishing a force of gravitational strength from this background will be a 
major challenge. 



3.2 Quantitative Example: Parallel Plate Gravity Experiment 
Idealized Setup 

To better appreciate how these problems might arise in actual experiments, let 
us now consider a simple experimental setup. Our goal here is to estimate the 
size of the various effects which might appear, and not to propose the optimal 
experimental design, and hence we will ignore practical problems which might 
be encountered when one actually attempts to realize such a design. Since we 
are searching for forces of very short range, the discussion of the previous section 
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Fig. 2. The idealized parallel plate 
setup nsed to quantitatively estimate 
the relative magnitudes of the gravi- 
tational, Casimir, and Yukawa forces. 
Numerical results were obtained by let- 
ting L = 1 cm, D = 1 mm, Pcopper = 
8.96 X 10® m®, T = 300 K, and d < L. 



suggests we should have most, if not all, 
of the mass of the two test bodies in an 
experiment contributing in order to real- 
ize the largest possible force. This means 
that we need to have all the mass in one 
body as close as possible to all the mass 
of the second body. The simplest way to 
accomplish this is to use parallel plates as 
our test bodies [29] which maximizes the 
“effective mass” for any short-range range 
force. It then follows that the most ap- 
propriate configuration for searching for 
new forces between macroscopic bodies is 
a parallel-plate experiment analogous to 
those used to study the Casimir effect [31] . 

Let us now consider two identical 
plates of density p, thickness D, area A = 
separated by a distance d (Figure 2). 
If we assume d <C L, we can then safely 
neglect edge effects and calculate the pres- 
sures between the plates as if L = oo. 
In addition, we assume that the plates 
are perfectly smooth, perfectly conduct- 
ing, and at temperature T = 300 K. 



Force Formulas 

We begin our investigation of the forces between these plates with the known 
forces, starting with gravity. For this particular configuration, the gravitational 
force acting on the plates is given by 



FGravity(d) = -2ttG , (12) 

where the minus sign indicates an attractive force. We see that when the plates 
are sufficiently close, the gravitational attraction is constant, independent of the 
separation d. 

If there are no stray charges, etc., gravity is the dominant force at large plate 
separations, but as d decreases, the Casimir force grows rapidly and quickly over- 
whelms gravity. Calculating the Casimir force for this geometry for real metals 
can become quite complicated, involving corrections for finite conductivity and 
surface roughness [32-34]. However, for present purposes we will ignore these dif- 
ficulties by assuming the plates to be smooth and perfectly conducting over all 
frequencies, which should be a good approximation as long as the plates are not 
too close. However, we will include thermal effects which become large when the 
plate separation is large. The Casimir force between our plates at temperature 
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T can be written as [35,36] 



-^Casimir(^) 



7T^ hcL^ nkTL'^ 

TT^{kT)^L‘^ 

45(fi.c)3 



In 

n—1 



1 — exp 




(13) 



where k is Boltzmann’s constant and factors of h and c have been included for 
convenience. Eq. (13) simplifies in the two limiting cases [35,36]: 



Ecasimir id) 



- 1.202 
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\ 47rrf3 J 
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d < 



Trhc 

W' 

Trhc 

W' 



(14) 



Having obtained expressions for the known forces acting between the plates 
in this idealized setup, let us now determine the forces arising from possible new 
interactions. The attractive Yukawa potential between point masses as given by 
Eq. (10) leads to a force between the plates (with d L) given by 



EYukawa(d) = -2tt G (l - (15) 



We then notice that the ratio of this Yukawa force to the gravitational force in 
Eq. (12) is 



TVukawa ( d ) 

^Gravity (d) 








(16) 



If A <C £), then 



EYukawa(d) ^ ^ 
Ecravity(d) \D J 



(17) 



Thus, even if the Yukawa coupling is intrinsically of gravitational strength (a = 
1), the actual Yukawa force is suppressed relative to gravity not only by the 
usual exponential factor but also by (A/H)^ which arises because only a 

fraction X/D of the total mass of each plate contributes to the Yukawa force. 
This effect was discussed earlier and clearly illustrates how a short-ranged force 
intrinsically of gravitational strength is strongly suppressed in an experiment 
using macroscopic bodies. 



Numerical Results 

Having found the general formulas for all the forces that we will be considering, 
let us now obtain numerical values for the following setup. We assume that the 
plates have dimensions L x L x D, where L = 1 cm and D = 1 mm, which 
are roughly comparable to the values used in some of the current short distance 
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Table 1. The magnitndes of the gravitational, Casimir, and Yukawa (using a = I and 
A = 10“® m) forces arising in the idealized parallel plate experiment discussed in the 
text. Here EBack = Ebravity + Ecasimir is the total background force against which the 
signal of EVukawa must be seen. 
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gravity experiments [9,13]. Since our previous calculations assumed d <C we 
focus our attention on the region 10“® m < d < 10“^ m. Next we will assume 
that the plates are made of pure copper which has a density p = 8.96x 10^ kg/m^. 
Except for the new force parameters a and A, our problem is now completely 
specified. 

Using these numbers, we first calculate the known forces, gravity and Casimir, 
for the plates. As discussed earlier, the gravitational force under the conditions 
assumed here is constant and given by Eq. (12). Substituting the parameters 
given above yields 

d^Gravity = 3.37 X 10“^^ Newton. (18) 



To determine the Casimir force for this configuration, we use Eq. (13). The 
cross-over distance dc, where temperature-dependent effects become important 
at T = 300 K, is 



dc — 



ttHc 

W 



= 2.4 X 10 ^ m = 24 pm. 



(19) 



Graphs of the Casimir force using Eq.(14), and the gravitational force between 
the plates, are shown together in Figure 3, and numerical values of these forces 
at various distances can be found in Table 1. The gravitational and Casimir 
forces are equal to each other when T = 300 K at d = 2.3 x 10“^ m = 23 pm, 
which just happens to coincide with dc here. Thus, for d ^ 23 pm, the Casimir 
force will dominate gravity in this setup. 

Now let us turn to new Yukawa forces, which are characterized by two free 
parameters, the relative strength a and the range A. If for illustrative purposes 
we consider a force of gravitational strength (a = 1) and set A = 10“® m, then 
Eq. (15) yields EVukawa (d) exhibited in Figure 3 and Table 1. We see that when 
d <C A, the force becomes constant: 



EVukawa(d <C A) = 3.37 X 10 Newton, (20) 

which is (X/D)^ = 10“^ times smaller than the corresponding gravitational force 
given by Eq. (18). Thus, as explained earlier, even though the Yukawa force 
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d (meters) 



Fig. 3. The Casimir, gravitational, and Yukawa (a = 1, A = 10 ® m) forces between 
the parallel plates shown in Fig. 2. 



between point particles is of gravitational strength at short distances, FVukawa is 
much smaller than gravity for these macroscopic plates. We also clearly see that 
^Yukawa/^Back is maximized when d ^ A and falls off rapidly from this plate 
separation (Fig. 4). This is because FVukawa levels off when d< X while Fcasimir 
continues to increase via a power-law (1/d'^ if ^ <^c)- 



Constraining New Short— Ranged Yukawa Forces 

This analysis using an obviously idealized setup reveals the two critical prob- 
lems that will be encountered in devising experiments using macroscopic bodies 
to search for very short-ranged Yukawa interactions of gravitational strength 
(a ~ 1). The first is that the absolute magnitude of such a force will be very 
small, possibly even smaller than the gravitational force if a is small. Thus, 
an experiment must be sensitive to the smallest possible forces. Second, since 
the Casimir background force grows rapidly as the separation decreases, one 
must be able to extract the signal of a very weak force from a background of 
very strong intermolecular forces. A direct attack on this problem would be 
to attempt to calculate as accurately as possible the background forces in a 
gravity experiment, and to then subtract these from the observed force to set 
limits using what remains [9,26,28,29]. However, recent experiments studying 
the Casimir force reveal the difficulty of accurately calculating the background 
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Fig. 4. The ratio of the Yukawa (a = 1, A = 10 ® m) force to the background force 
Fuack = Fcravity + Fcasimir for the parallel plate setup shown in Fig. 2. 



forces to better than 1 % [25,27,32-34,37,38]. While this approach is still possible, 
we will describe in the next section a new method of performing a null short- 
distance gravity experiment specifically designed to directly subtract out the 
unwanted background effects. Calculating intermolecular forces precisely then 
becomes much less important. 

4 Very Short Distance Null Gravity Experiments 

Some of the best constraints on Yukawa interactions come from tests of the WEP 
[1]. In these experiments, one compares the accelerations of compositionally- 
different test bodies toward a common source body. Any differences in these 
accelerations can then be attributed to non-gravitational forces. We wish to 
utilize the same principle in a short-distance gravity experiment, and thus avail 
ourselves to the extreme sensitivity of such experiments. 

4.1 Null Experiment #1 

Inspired by two ongoing short distance experiments [9,13], a possible design for 
one such experiment is shown in Figure 5. It consists of two parallel plates, 
a source plate 1 and a detector plate made of two smaller plates 2 and 2'. 
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Fig. 5. Schematic design for Very Short Distance Null Experiment #1. See text for 
details. 



The source plate is driven sinusoidally with angular frequency ui such that the 
separation distance d is given by 

d{t) = do + di cos ujt. (21) 

Instead of detecting a force, this experiment would be sensitive to a torque, 
modulated by the frequency uj, about an axis passing along the boundary where 
plates 2 and 2' are joined, as shown in Fig. 5. If we assume that the plates are 
conducting, the net torque r„et on the detector plate will arise from contributions 
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from the gravitational, Casimir, and possibly, Yukawa forces: 



T’net — ^Gravity + t'C; 

L 



Dtukawa 



2 L 






Gravity 

2 



T-iGravity 
~ ^2' 



) + {F, 



Gasimir 






Casimir\ 



I ( TT’Yukawa T7’Yukawa\' 
+ 1-^2 ^ 2 ' ) ^ 



(22) 



where ^^iravity^ ^Casimir^ ^Yukawa gravitational, Casimir, and Yukawa 

forces on plates i = 2 and 2' respectively. One then selects plates 2 and 2' 
such that the torque rcravity + Tcasimir due to background forces vanishes while 
TYukawa 0 if «ii yf 0. [Here we allow for the possibility that «i 2 and ai 2 ', the 
Yukawa couplings between the materials comprising plates 1 and 2, and 1 and 
2' respectively, are different.] 

At very small separations, t>e negligible (and independent of u) 

to first approximation) , but it is still easy to make rcravity vanish anyway. Using 
Eq. (12), we see that 

I^Gravity _ ^Gravity | ^ 2ttGL'^PiDi \p 2 D 2 ~ P2'^2'h (23) 



where pi and Di are the density and plate thickness of the tth plate. Then 



{P2D2 — P2'D2') {M2 — M2') TCravity — Oj (24) 

so if the detector plates 2 and 2' have the same mass, the gravitational torque 
will vanish. 

Of course, the much bigger challenge is choose materials for plates 2 and 2' 
such that the Casimir torque Tcasimir also vanishes. If the plates were perfectly 
conducting, this would be the case since Eq. (13) would be identical for all 
such plates with the same surface area. However, the finite conductivity of real 
metallic plates becomes very important when the plate separation d ^ Xp, where 
Xp = 2 ttcIujp, and top is the plasma frequency of the metal. Still, it was shown 
recently [32] that the Casimir force between pairs of copper and gold plates 
are equal to a good approximation for separations d > 10~® m at T = 0. Such 
calculations are difficult for real materials, but this certainly raises the hope that 
it is possible to choose appropriate plates 2 and 2' such that 

pC—_pC—^Q ^ TCasimir-0. (25) 



At the very least, one should be able to fabricate the plates using two differ- 
ent isotopes of the same element (e.g., ^"^Mg and ^®Mg) such that Eq. (25) is 
satisfied. The underlying premise of the “iso-electronic” effect (IE) is that to a 
good approximation the Casimir effect depends on the electronic properties of 
the materials, and hence is largely independent of their nuclear properties. By 
contrast, the gravitational interaction, and virtually all proposed new Yukawa 
interactions, involve couplings to both electrons and nucleons. Hence, subtract- 
ing out the electronic contributions by choosing two isotopes of some material, or 
by choosing materials with similar electronic properties (such as Cu and Au), we 
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can enhance the signal from a new Yukawa force while simultaneously reducing 
the Casimir background. 

The remaining torque after the gravitational and Casimir torques have been 
suppressed might be due to a new Yukawa force. Using Eq. (15), the net torque 
due to a putative Yukawa force would be 

TYukawa = 2ttGX^ {L I 2) L‘^ p, (l - 

X Q;i2P2 (l - - q;i2'P2' (l - . (26) 

If A ^ Di, then Eq. (26) simplifies to 

TYukawa — 27rGA^(L/2)L^pie“‘^^^ [ai2P2 — <ai2'P2'] • (27) 

If the Yukawa force arises from an extra-dimensional modification of Newtonian 
gravity such that Oi 2 = o;i 2 ' = On, Eq. (27) reduces to 

TYukawa - 2'Ka^GX^ {L I2)L^ ^ 

This result depends on the difference in the mass densities p 2 — P 2 ' for a simple 
reason: When X Di, the force only sees the mass within a distance A of the 
surface. Thus, if plates 2 and 2' have different densities, the effective mass seen 
by the Yukawa force will be different and a net torque arises due to the “finite- 
size” effect discussed earlier. It is then clear that one should choose materials 2 
and 2' such that p 2 and p 2 ' differ as much as possible while still ensuring that 
the Casimir torque vanishes. For the gold/copper and ^^Mg/^®Mg combinations 
suggested above, 

r 0.32 2 = Au, 2' = Cu, 

1 - ^ ~ i (29) 

^2 [ 0.077 2 = 26Mg, 2' = 24Mg. 

The hope is that the suppression factor Eq. (29) is more than compensated by 
the reduction of the unwanted background torques. 

4.2 Null Experiment #2 

We conclude this section by briefly describing another possible design for a short 
distance null experiment motivated by another set of gravity experiments [14,39]. 
As shown schematically in Figure 6, this experiment consists of two disks, one 
serving as the source mass, while the other (detector mass) is the pendant of 
a torsion pendulum. Each disk is divided into alternating wedges made of two 
different materials 1 and 2. The wedges and materials 1 and 2 are designed 
such that no Casimir torque would arise when the source disk rotates below the 
pendulum. For example, as indicated above, 1 and 2 might be gold/copper or 
^^Mg/^®Mg which would significantly reduce the Casimir torque. Then, if the 
separation between the disks is small, the remaining torque on the pendulum 
would arise from a putative Yukawa force because the effective mass within a 
distance of A will be different for alternating wedges. 
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Fig. 6. Schematic design for Very Short Distance Nnll Experiment ^2. See text for 
details. 



5 Discussion 

It is clear that there is significant motivation for testing Newtonian gravity 
at very short distances. However, as we have seen, new problems will face ex- 
perimentalists who wish to extend current constraints on the Yukawa coupling 
constant ol down to ranges A < 10“^ m. Our aim in this paper has been to point 
out some of the most obvious difficulties, but there may be others that have 
passed unnoticed. While we have not addressed the significant issue of improv- 
ing the sensitivity of experiments to very small forces, we have taken the first 
steps towards dealing with the problem of detecting small forces against a large 
intermolecular force background. Our schematic designs for null experiments are 
meant as illustrations of the principles involved in canceling background forces 
(the “iso-electronic” and “finite-size” effects), and hence are not intended to 
suggest optimal designs. It is hoped that experimentalists, who face the realities 
of imperfect materials and incomplete theories, can extract some useful ideas 
from these schematic designs or, perhaps, will be able to point to flaws which 
preclude them from working as actual experiments. Finally, we conclude with 
the encouraging note that since so little is known about gravity at separations 
< 10“^ m, virtually any good experiment in this region will tell us something 



new. 
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Abstract. The main general relativistic effects in the motion of the Moon are briefly 
reviewed. The possibility of detection of the solar gravitomagnetic contributions to the 
mean motions of the lunar node and perigee is discussed. 



1 Introduction 

In a recent paper, Gutz wilier has provided an admirable review of the oldest 
three-body problem, namely, the Sun-Earth-Moon system [1]. Some work on 
the relativistic theory is mentioned in his paper; however, in view of the recent 
advances in relativistic celestial mechanics this subject deserves a more complete 
discussion. Here we provide a brief description of the main relativistic effects. 

The lunar laser ranging experiment has opened up the possibility of measur- 
ing relativistic effects in the motion of the Moon; indeed, the agreement between 
the standard general relativistic model that contains over a hundred model pa- 
rameters and the ranging data accumulated over the past three decades is ex- 
cellent [2,3]. For instance, the post-fit residuals in the Earth-Moon distance are 
at the centimeter level [2,3]. Simple theoretical estimates lead to the conclusion 
that the main relativistic effects in the lunar theory are due to the spin-orbit 
coupling of the Earth-Moon system in the gravitational field of the Sun. The 
post-Newtonian influence of the solar field on the lunar motion consists of terms 
that can be classified as either harmonic (i.e. periodic) or secular (i.e. cumula- 
tive) in time. It turns out to be very difficult in practice to separate the harmonic 
terms from the corresponding Newtonian terms with the same periodicities. In 
effect, the existence of the post-Newtonian harmonic terms leads to small rel- 
ativistic corrections in the numerical values of certain model parameters that 
are thereby adjusted by a fit to the ranging data. To give an example of such 
harmonic effects, we mention our prediction of a 6 cm relativistic tidal variation 
in the Earth-Moon distance with a period of 1 /2 synodic month [4] . 

The main secular terms turn out to be essentially due to the precessional 
motion of the Earth-Moon orbital angular momentum in the field of the Sun. 
The Earth-Moon system can be thought of as an extended gyroscope in orbit 
about the Sun; we are interested in the description of the motion of the spin 
axis of this gyroscope with respect to the “fixed” stars (i.e. the sidereal frame). 
An ideal pointlike test gyroscope carried along a geodesic orbit would exhibit, 
in the post- Newtonian approximation, geodetic precession due to the orbital 
motion around the mass of the source as well as gravitomagnetic precession due 
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to the intrinsic rotation of the source; however, the finite size of the gyroscope 
in this case (i.e. the orbit of the Moon about the Earth) leads to additional tidal 
effects. The main post-Newtonian gravitoelectric effect, i.e. geodetic precession, 
results in the advance of the Moon’s node and perigee by about 2 arcseconds 
per century as first predicted by de Sitter already in 1916 [5]. This motion has 
been measured by Shapiro et al. with an accuracy of about one percent [6]. It 
is a relativistic three-body effect; therefore, we consider in the next section the 
restricted three-body problem in general relativity and briefly indicate, in par- 
ticular, the more subtle post-Newtonian gravitomagnetic contributions to the 
motions of the Moon’s node and perigee that are caused by the rotation of the 
Sun; indeed, solar rotation induces cumulative relativistic tidal effects in the 
Earth-Moon system [4]. 



2 Restricted Three-Body Problem in General Relativity 

In our previous work [4] , we developed a new scheme for the approximate treat- 
ment of the restricted three-body problem in general relativity. This coordinate- 
invariant approach is particularly useful for a reliable theoretical description of 
relativistic (solar) tidal effects in the motion of the Earth-Moon system. We as- 
sume that the Moon follows a geodesic in the gravitational field generated by the 
Earth and the Sun. This field may be calculated as follows: we first imagine that 
the Earth follows a geodesic in the solar field. Along this geodesic, we set up a 
geocentric Fermi coordinate system. This system, which involves the tidal field 
of the Sun, is then enhanced by taking due account of the field of the Earth in 
the linear approximation. Tidal effects in general relativity involve the projection 
of the Riemann tensor onto the tetrad frame of the measuring device. Consider 
the tidal matrix for a test system ( “Earth” ) in free fall in the gravitational field 
of a rotating mass (“Sun”). In the standard first-order post-Newtonian treat- 
ment, the spatial axes of the local tetrad frame along the orbit are obtained 
by boosting the background Minkowski axes and adjusting scales to maintain 
orthonormality; the resulting tidal matrix for an approximately circular geodesic 
orbit turns out to be sinusoidal in time [7]. In this case, the tetrad frame is not 
parallel-transported, but its motion involves the Lense-Thirring orbital preces- 
sion as well as the geodetic (i.e. de Sitter-Fokker) precession of the spatial axes. 
Once the parallel transport of the spatial axes along the orbit is imposed, the 
gravitomagnetic (i.e. Schiff) precession of the spatial axes would also appear in 
the first post-Newtonian order. In this order, the tidal matrix for the parallel- 
transported axes contains a secular term as well that must therefore be a direct 
consequence of the Schiff precession of the spatial axes [8], in agreement with 
our previous work [9-11]. The linear growth of this gravitomagnetic contribution 
to the tidal field poses a problem for the first post-Newtonian approximation: 
the non-Newtonian “off-diagonal” part of the tidal matrix can diverge in time 
[9-11]. To avoid this limitation, we have developed a post-Schwarzschild treat- 
ment of gravitomagnetic tidal effects; indeed, the concept of relativistic nutation 
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provides a natural resolution of this difficulty by limiting the temporal extent of 
validity of the post-Newtonian approximation [9-11]. 

Imagine, for instance, a set of three orthogonal test gyroscopes falling freely 
along an inclined circular geodesic orbit with constant radius r (“astronomical 
unit”) about a slowly rotating central body (“Sun”) with mass M and proper 
angular momentum J . The motion of the spin axes of these torque-free gyro- 
scopes, which constitute a local inertial frame (i.e. the geocentric Fermi frame), 
is essentially governed by the equations of parallel transport along the geodesic 
orbit. By solving these equations using the post-Schwarzschild approximation 
scheme that takes M into account to all orders, it can be shown that the av- 
erage motion of the gyroscope axes with respect to an effective Newtonian (i.e. 
sidereal) frame consists of a gravitoelectric processional motion — i.e. geodetic 
precession that was first completely analyzed by Fokker — together with a com- 
plex gravitomagnetic motion that can be loosely described as a combination of 
processional movement and a harmonic nodding movement. The latter motion is 
a new relativistic effect of a rotating mass and has been referred to as relativistic 
nutation [11]. In the post-Newtonian approximation, the nutational terms over 
a limited time combine with the other gravitomagnetic processional terms to 
give the Schiff precession. To see how this comes about, let us denote by r the 
proper time of the geodesic orbit and consider a vector normal to the orbital 
plane (ecliptic) at the beginning of measurement (r = 0). Relativisticnutation 
is a periodic variation of the angle between this vector and a gyroscope axis 
that is Fermi propagated along the orbit. The leading contribution of relativistic 
nutation to this angle can be written as 

6>n « C [sin(r7o + wft) - sin 770 ] sin a, (1) 

where rjQ is the azimuthal position of the Earth in the ecliptic at t = 0 measured 
from the line of the ascending nodes and ^ = J/Mr'^uj. Here co,uj'^ = GM/r^, 
approximately describes the orbital frequency in the absence of rotation and 
a denotes the inclination of the orbit with respect to the equatorial plane of 
the Sun [12]. The frequency of this nutational oscillation is the Fokker frequency 
Lop « |ew, where e = GM/c^r. The nutation amplitude, ^ sin a, does not depend 
on the speed of light c. This remarkable fact can be traced back to the occurrence 
of a small divisor [9-11] involving the Fokker frequency. In the post-Newtonian 
limit of the post-Schwarzschild approximation, Eq. (1) reduces to ~ 
which represents a precessional motion with frequency uj„ = sin a cos 770 
about a direction opposite to that of orbital velocity at r = 0. Thus, relativistic 
nutation reduces to a part of the Schiff precession in the first post-Newtonian ap- 
proximation. It follows from this analysis that the first post-Newtonian approx- 
imation breaks down over timescales of the order of Fokker period tf = 2t: jojp] 
however, this fact does not diminish the usefulness of the first post-Newtonian 
approximation for the description of observations in the solar system since in 
this case the Fokker period is almost immeasurably long (e.g. Tp ~ 67 million 
years for the motion of the Earth about the Sun). 

Let us consider the influence of the gravitomagnetic field of the central body 
(“Sun”) on the relative acceleration of two nearby test particles (“Earth” and 
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“Moon”) moving along the circular geodesic orbit. The dominant contributions of 
the gravitomagnetic field of the central body to the tidal matrix, first calculated 
by the authors [9-11], are proportional to 

w^^sinasin , (2) 



which is directly proportional to the amplitude of relativistic nutation sin a) 
and exhibits a maximum (at r = Tpf2) that is independent of the speed of light 
c. It follows from Eq. (2) that to first order in ujft <C 1, the dominant grav- 
itomagnetic amplitude varies linearly with r. This secular amplitude originates 
from a coupling of the nutation part of Schiff precession with the amplitude 
of the Newtonian contribution to the gravity gradient [13]. It should be 
mentioned in passing that the relativistic quadrupole contributions to the tidal 
matrix have properties quite similar to the gravitomagnetic tidal effect described 
here [10]. 

Let us now turn to the potentially observable effects of the solar gravitomag- 
netic field on the lunar motion. The lunar path is determined by the Newton- 



Jacobi equation 



(fx^ Gm , 



-Kij{r)x^ , 



(3) 



where = 1,2,3, represent the geocentric Fermi coordinates of the Moon, m 
is the total mass of the Earth-Moon system and R{t) denotes the Earth-Moon 
distance depending on the proper time r measured along the geocentric path 
around the Sun. Here K is the tidal matrix. Equation (3) describes the motion 
of the Moon with respect to a geocentric local inertial frame [14]. Using the 
equation of relative motion (3), we have calculated — among other things — the 
influence of the tidal field of the Sun on the orbital angular momentum of the 
Moon with respect to the Earth. To express the result with respect to the sidereal 
frame, we choose as our sidereal reference framethe geocentric Fermi frame at 
T = 0. This frame is related to the Fermi frame at time r by a rotation matrix 
that incorporates the relativistic precession and nutation of the Fermi frame 
with respect to the sidereal frame. In the first post-Newtonian approximation, 
this motion reduces to a (Fokker plus Schiff) precession. Let D denote this 
rotation matrix, then Ci = DijLj, where the sidereal components (Ci) of the 
orbital angular momentum are obtained from a transformation of the geocentric 
components (Lj) with 

Dij = 6ij - Cijk^k , ^ / ujFs{r')dr'. (4) 

^0 



Here the analysis is limited to the first post-Newtonian approximation and lofs 
represents the frequency of (Fokker plus Schiff) precession. The direction of Schiff 
precession is not fixed along the Earth’s orbit; therefore, ^ contains (cumulative) 
secular terms (which represent simple precession) together with (harmonic) nu- 
tational terms of frequency 2w and amplitude of order ae^. Averaging over the 
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latter terms, the dominant secular terms in <P are given by 



~ -ae^wT sin ? 7 o , <p 2 




euT 



^3 



1 

2 



ae^coT cos t]q 



( 5 ) 



with respect to the geocentric Fermi frame at r = 0, which has its first axis 
essentially along the radial position of the Earth, the third axis approximately 
along the direction of motion of the Earth and the second axis normal to the 
ecliptic (in a direction opposite to the Earth’s orbital angular momentum about 
the Sun). We note that for C = Oj Eq. (5) expresses the de Sitter-Fokker effect 
that has been observed by Shapiro et al. [6]. To illustrate our approach, let us 
use Eq. (3) to determine the value of L, which is the angular momentum of 
the Moon in a circular orbit about the Earth with respect to the Fermi frame, 
averaged over orbital motions of the Earth about the Sun (with frequency w) 
and the Moon about the Earth (with frequency fi). Then 

dm . , 

— — — w X Lq , (6) 

dr 

where Lq is the unperturbed orbital angular momentum with respect to the 
geocentric Fermi frame and oj is given by 

WT cos 770) , ( 7 ) 



~ ( 2 sin ^ 



« wo(l - 6e^), 



(8) 



« —ujQae^ ^2 cos 770 ~ ( 9 ) 

to first order in the tidal perturbation characterized by the Newtonian regres- 
sion frequency wq = 3w^/4l7, which corresponds to a period of nearly 18 years 
[15,16]. It is clear from Eqs. (4)-(9) that the motion of (£) can be expressed 
as a Newtonian regression modulated by long-term (secular) relativistic pertur- 
bations characterized by the de Sitter-Fokker, Schiff and gravitomagnetic tidal 
effects. To illustrate this point, let us assume for the sake of simplicity that in 
the absence of relativistic effects the lunar orbital angular momentum undergoes 
a steady regression of frequency cug and that once relativistic effects are included 
the average motion in the Fermi frame is one of precession with the frequency 
given by Eqs. (7)-(9). It then follows that the expression for (£ 2 ), he. the aver- 
age of the second sidereal component of the lunar orbital angular momentum, 
contains a dominant gravitomagnetic contribution of the form 

(£ 2 ) secular « 2a/3eC(M7?o^)wTsin(T7o -k Co - wqt), (10) 

where (3, /i, i?o and Co denote, respectively, the inclination of the lunar orbit with 
respect to the ecliptic (« 5°), the mass of the Moon, the mean Earth-Moon sepa- 
ration and the longitude of the ascending node of the orbit of the Moon measured 
from the first axis of the sidereal frame. These simple considerations that are 
based on an initial circular orbit only indicate the nature of the secular terms 
involved; clearly, extensive calculations are necessary for a complete treatment. 
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3 Discussion 

The results of our theoretical work are of particular interest for the description 
of dominant relativistic gravitational effects in the motion of the Moon, espe- 
cially the gravitomagnetic tidal component of the motion of the orbital angular 
momentum of the Moon. It is important to point out that the eccentricities of 
the orbit of the Earth around the Sun and the orbit of the Moon about the Earth 
should be taken into account; we have ignored them in our preliminary analysis 
[16]. As lunar laser ranging data further accumulate, it may become possible in 
the future to deduce the angular momentum of the Sun from the measurement of 
the solar gravitomagnetic contributions to the mean motions of the lunar node 
and perigee. 

It is interesting to compare our secular gravitomagnetic tidal terms with hy- 
pothetical terms that might indicate a temporal variation of the gravitational 
“constant” G. Our results have thus far been based on a secular term propor- 
tional to r in the tidal matrix K in Eq. (3); however, as can be seen from the 
middle term in Eq. (3), similar effects could be produced if such a term appears 
in G instead. We have shown that our predictions are similar to a variation of G 
in Eq. (3) at the level of 10“^® yr~^; moreover, there are significant differences 
between the two effects that can be used to separate them [4,9]. The present up- 
per limit on |G/G| is at the level of 10“^^ yr~^; therefore, it may be a long while 
before the gravitomagnetic effects in the motion of the Moon become detectable. 
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Abstract. 30 years of lunar laser ranging (LLR) data has been modeled and fit with 
several millimeters precision using the general relativistic equations of motion for solar 
system dynamics. This produces several key tests of that tensor theory of gravity and 
strongly constrains presence of any supplementary interactions. Earth and Moon fall 
toward the Sun at rates equal to a couple parts in 10^®, confirming both the universal 
coupling of gravity to matter’s stress-energy tensor, and gravity’s specific non-linear 
coupling to itself. The expected deSitter precession (with respect to the distant ‘fixed’ 
stars) of the local inertial frame moving with the Earth-Moon system is confirmed to 
3.5 parts in 10® precision, and Newton’s constant indeed shows no cosmological time 
variation at the few parts in 10®^ per year level. All the types of post-Newtonian terms 
in the A-body equation of motion — motional, gravito-magnetic, non-linear, inductive, 
etc. — contribute to the measured details of the lunar orbit, so LLR achieves ’near- 
completeness’ as a gravity experiment and probe. The precision of these measurements, 
especially those connected with lunar orbit frequencies and their rates of change, should 
further improve as LLR observations continue into the future. 



1 Introduction 



In the late 1960s I was seeking new ways to test general relativity theory using 
the new possibilities of space-based experimentts. This led me to calculate the 
gravitational to inertial mass ratio for celestial bodies which contained appre- 
ciable internal gravitational binding energy. Most alternative theories of metric 
gravity yielded an ‘anomalous’ result — that this ratio for a body depends on 
its fractional gravitational binding energy [ 1 ] 



M{G) ^ ^ G 

M{I) ~ 2Mc2 



p{x)p{y) 
\x - y\ 



d^x d^y 



( 1 ) 



resulting in a body-dependent acceleration rate in external gravitational fields 



di — 




(2) 



Here 77 is a theory-dependent dimensionless coefficient sensitive to just about 
every post-Newtonian feature of theory and which vanishes in general relativity. 

Shortly after this theoretical work I learned of the forthcoming lunar laser 
ranging (LLR) capability which would result from the Apollo landings on the 
Moon. LLR would be sufficiently sensitive to measure the difference between 
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the acceleration rates of Earth and Moon toward the Sun to a scientifically 
interesting precision; there was a fortunate near-resonance amplification of a 
synodic month perturbation in the lunar orbit in proportion to any acceleration 
difference 






1 + 2 - 



- 17 









M 



5su 



10 rj cos(u! — S7)t 



uj^ — (oj — 17)2 
meter 



cos(w — I7)t 



(3) 

(4) 



in which oj and w — 17 are the lunar orbit’s sidereal and synodic frequencies, 
respectively [2]. 

The passive laser refiectors placed on the Moon by Apollo astronauts were 
developed in Robert Dicke’s research group at Princeton University. One of their 
mission goals was to measure the changes in the lunar orbit which would result 
from any evolution of Newton’s G in proportion to the Hubble expansion of the 
universe 



1 dG _in 

— — ~ i7 ~ 10 per year. 

Ct dt 

A third LLR measurement of general and historical interest in gravitation 
eventually emerged from the data — ‘geodetic precession’ of the lunar orbit. 
At the dawn of the general relativistic age, within months of Einstein’s publi- 
cation of his gravitational theory in 1916, Wilhelm deSitter applied the theory 
to the calculation of post-Newtonian corrections in lunar motion [3]. The most 
promising outcome of his analysis was a relativistic contribution to the secular 
precession rate of lunar perigee due to the non-radial motion of the Earth-Moon 
system through the Sun’s gravitational field 

3 GMsun I u .,^1 ff.-, 

^ds= 2^+^ l^x ^1- (5) 

Angular astronomical observations never became precise enough to see this rela- 
tivistic precession, but finally in the late 1980s the accumulated LLR data (since 
1969) permitted its measurement [6,5], and with ever-increasing precision in 
subsequent years [7,13]. This precession, which can simply be thought of as the 
tensor field analog of the electromagnetic field’s spin-orbit force, has also ac- 
quired the space-time geometrical interpretation of being the precession of the 
local inertial frame which at each instant accompanies the Earth-Moon system 
through the Sun’s gravity. 

Today, 30 years of LLR data allows not only the three measurements de- 
scribed above, but also a comprehensive collection of other measurements which 
map out the long range, post-Newtonian interaction between bodies in all its 
details. The ranging continues, and the measurement precision should further 
improve into the future. 
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2 Dynamical Equations for Bodies, Light, and Clocks 

General relativity theory makes a number of predictions about details of the 
Moon’s orbit which differ from those calculated from either Newtonian gravity 
or from alternative relativistic theories. That theoretical situation, coupled with 
an accumulated thirty years of laser ranging measurements to the Moon now 
reaching sub-centimeter levels of precision, and some fortuitous amplifications 
of the theory-differentiating perturbations found in the Sun-Earth-Moon dy- 
namical system [8], results in LLR today providing several of the most precise 
measurements supporting the hypothesis that general relativity’s tensor gravity is 
a complete description of the interaction among bodies at macroscopic ranges. 

The most scientifically interesting features of post-Newtonian gravity which 
are measured with high precision in LLR are a consequence of the fully general 
fV-body equations of motion. Analysis groups implement these equations to 
fit the LLR data in the solar system’s barycentric frame. The Sun-Earth-Moon 
system dynamics is symbolically illustrated in Figure 1, with the rest of the solar 
system bodies sufficiently considered at the Newtonian level of perturbation. 
The Earth moves with velocity V and acceleration A with respect to the Sun, 
while the Moon is moving at velocity V + u and acceleration A + a, and for 
purposes of discussing preferred frame effects if gravity is not locally Lorentz- 
invariant [4], the Sun (solar system) moves with cosmic velocity W. The post- 
Newtonian forces on Earth and Moon from the Sun, each other, and themselves 
are dependent on these general motions, and there are non-linear gravitational 
forces for which each mass element of the Earth and Moon experiences forces 
due to the interactive effect of the Sun’s gravity with the other mass elements 
of the same body, or of the other body. 

The fV-body equation of motion in metric gravity has been formulated in the 
literature for the completely general case [10]. Preferred frame effects (i.e., ab- 
sence of local Lorentz-invariance in the dynamical equations) then generally will 
occur in solar system phenomena due to the high speed with which our system 
moves through the cosmos [4,12]. LLR data has been tested for such effects, and 
none are found to a part in 10^ of their expected magnitude, so I will specialize 
consideration to locally Lorentz-invariant gravitational physics. Similarly, the 
conservation laws for total energy, momentum, and angular momentum, which 
we usually expect to hold in the physics of an isolated N-body system, are also 
not assumed in the general case. But analysis of the LLR data shows no evidence 
for any conservation law violations, so a-posteriori, I specialize to consideration 
of fully conservative, Lagrangian-based equations of motion. 

Decades ago Eddington generalized the discussion of the gravitational physics 
in the static, spherically symmetric field of the Sun, in order to manifest some of 
the novel experimental possibilities from alternative gravitational theories. He 
concluded that two free parameters 7 and (3 were sufficient to encompass the 
plausible variations in theory. It is remarkable that when the empirically con- 
firmed foundations of local Lorentz-invariance and energy-momentum-angular 
momentum conservation laws are accepted, then only the two Eddington pa- 
rameters 7 and [3 are needed to parameterize the A^-body equations of motion 
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Velocities and Accelerations of Sun, Earth, and Moon 




Fig. 1. When formulating the Earth-Moon dynamics in the solar system barycentric 
frame, there are post-Newtonian force terms acting between Sun, Earth, and Moon 
which depend on either the velocity or acceleration vectors of both the Earth and Moon. 
Body self- accelerations also result from the inductive inertial forces acting between the 
mutually accelerating mass elements {i, j) within each of these bodies. The intrinsic 
non-linearity of gravity also produces net external forces on these bodies proportional 
to not only the presence of other bodies, but also to their internal gravitational binding 
energies. The motional, accelerative, and non-linear contributions to the three body 
system’s dynamics, taken collectively, make LLR a comprehensive probe of the post- 
Newtonian dynamics of metric gravity in the general case. If the dynamics is not locally 
Lorentz invariant, then the velocity W of the solar system through the cosmos leads to 
novel forces and resulting observable effe cts in LLR proportional to W (or its square); 
but such effects have not been seen. 

for the general dynamical situation. (It is possible that a very small part of 
the gravitational interaction could be of a finite range Yukawa nature, rather 
than inverse square; this possibility is separately tested with the LLR data and 
discussed later.) 

For N bodies in general motion and configuration, and valid for a broad class 
of empirically plausible metric theories of gravity, the order 1/c^ equations of 
motion for these bodies have the form 
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see page 7 of this Volume for an explanation of the symbols. The body grav- 
itational mass strengths Hi = GMi are indicated along with the Newtonian 
acceleration vectors 

= (7) 



ij 



Some descriptions of the several lines of this total equation of motion are worth- 
while. 



Line A: Whenever the metric theory Eddington parameters 7 and /3 differ from 
their general relativistic values 7 gr = Pgr = 1? theoretical consistency re- 
quires that the gravitational to inertial mass ratio of celestial bodies depend 
on the bodies’ gravitational self-energy content 



M{G) 

~m) 



1 - (4/3 - 3 - 7) 



G 

2Mc2 



p{x)p{y) 
\x - y\ 



dPx dPy + 0{l/c^) 



(8) 



and Newton’s coupling parameter G will generally vary in time in proportion 
to the Hubble expansion rate of the universe 

1 dG 

( 9 ) 

In LLR, it turns out that the most precise way to measure any deviation of P 
from its general relativistic value is through measurement of the M{G) /M (!) 
ratio of Earth as given by Eq.(l). 

Line B: Gravity couples to itself, thereby producing non-linear gravitational 
forces between bodies. 

Line C: Just as pairs of moving charges generate magnetic forces between them- 
selves in proportion to the velocities of both charges, pairs of moving masses 
generate gravito -magnetic forces between themselves. 

Line D-. Masses in motion both produce and couple to gravitational fields dif- 
ferently than masses at rest. 

Line E: Accelerating masses generate inductive gravitational forces on other 
proximite masses. 

Line F: The inertia of a mass is altered by its motion and by its proximity to 
other masses. 
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LLR involves precisely measuring the round trip time of propagation of light 
between two separate trajectories, and using a clock moving on one of those tra- 
jectories. So in the solar system barycentric and isotropic coordinates employed 
to express the body equations of motion, there are also model requirements for 
the post-Newtonian modifications to the light coordinate speed function 

c(r-) = Coo (1 - (1 + 7) C^(^)/c^) (10) 

and for clock rates 



dr = dt (1 — U{r)/c^) , 



( 11 ) 



in which U{r) is the total Newtonian gravity potential function due to solar 
system bodies 



C/(r) = ^ 

3 



\r — r'\ 



dV. 



( 12 ) 



It is necessary to use these equations in the modeling of the LLR data in or- 
der to achieve parameter fits consistent with those obtained from geocentrically 
modeled satellite laser ranging measurements, and to eliminate relatively small 
spurious range effects due to motion of the Earth and Moon into and out of 
the Sun’s gravitational potential; otherwise these light and clock equations play 
only a supportive role to the body equations of motion in LLR. These equa- 
tions for clock rates and light speeds also establish the relationships between 
the convenient and global solar system barycentric coordinates for space and 
time locations, and the so-called proper local coordinates measured by rulers 
and clocks. 



3 New Long Range Force? 



An additional long-range interaction in physical law would generate a force 
between bodies i and j which is likely to have the static limit structure 

/, = iL, V, (13) 

Tij 



The coupling strengths Ki and Kj, except in special cases, will be body at- 
tributes different than total mass-energy {non-metric coupling), and the depen- 
dence on distance of this force can either be inverse square or Yukawa-like if the 
underlying field transmitting this force between bodies is not massless. Such a 
new force will typically produce a difference in the Sun’s acceleration of Earth 
and Moon, because these two bodies are of different compositions — The Earth 
has a substantial iron core while the Moon is composed of silicate mantle-like 
materials. This Equivalence Principle violating acceleration difference amounts 
to 
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and it will supplement, even in the /x — >■ 0 limit, any difference of accelera- 
tions resulting from the previously discussed gravitational to inertial mass ratio 
anomalies of the bodies. LLR has become a sufficiently precise tool for measur- 
ing |(5aem|j it now competes favorably with ground-based measurements of the 
universality of free fall. 



4 LLR’s Science— Related Range Signals 

Assocated with each feature of gravitational theory which is tested by LLR, 
there are specific range signals in the LLR data whose measurements yield the 
information about theory. Several of these signals are here described. 



Violation of Universality of Free Fall 

If Earth and Moon fall toward the Sun at different rates due to either of the 
mechanisms given by Eqs.(8) or (14), then the lunar orbit is polarized along the 
solar direction. Detailed calculation of this polarization reveals an interesting 
interactive feedback mechanism acting between this cos D polarization signal 
and the cos(2Z?) Newtonian solar tide perturbation of the lunar orbit called the 
variation) . The result is a range signal enhanced from the simpled estimate given 
in Eq.(3) 



3 i? 

Sr{t)yne = 7 — A F{[2/uj) cos D , 


(15) 


2 u) 


= 2.9 X 10^^ (5me cosD cm. 


(16) 



with Jem = |(®e ~ Qm)/fl's|) R IS distance to the Sun, Q and w are the side- 
real frequencies of solar and lunar motion, and D is the lunar phase measured 
from new moon. The feedback amplification factor for the lunar orbit is already 
F{f2/(jj) = 1.75; it grows further with larger orbits and approaches an interest- 
ing singularity for an orbit less than twice as large as that of the Moon [8,9]. 
The most recent fits of the LLR data find no anomalies in the cos D amplitude 
to precision of a few millimeters, so from Eq.(16) this constrains S^e to be less 
than about 2 x 10“^^! This provides a constraint on a combination of the two 
Eddington parameters 

[4/3-3-71 < 5 X IQ-'^. (17) 

Computer integration of the complete Eq.(6) for the Sun-Earth-Moon system 
dynamics confirms these analytically estimated polarization sensitivities. 

From LLR’s measured constraint Sem < 2 x 10“^^, the curves on the right 
in Fig. 2, labeled Equivalence Principle are derived and show the constraints on 
the strength (relative to Newtonian gravity) as function of force range of any 
additional metric or non-metric Yukawa interaction which might supplement the 
tensor field interaction of general relativity. These curves continue horizontally to 
infinite range (inverse square force law) in which domain this LLR measurement 
is presently one of the premier probes of the gravitational interaction. 
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The Newtonian solar tidal perturbation of the Moon’s orbit at second order in 
the orbit’s radius also produces a range oscillation at the synodic frequency; and 
it is 110 km in amplitude! But this perturbation, discovered by the generation 
after Newton and called the parallactic inequality, has amplitude proportional to 
(l7/w) jR. The least precisely known factor here is the Moon’s mean orbital 
size, but it need only be known to half a part in 10® (200 cm) in order that 
the parallactic inequality be determined to the sub-millimeter level and thereby 
enabling further perturbations of a few millimeters to have significance. 

Geodetic Precession of the Local Inertial Frame 

If one collects the difference between the Sun’s acceleration of the Moon and of 
the Earth which emerge from lines D and F of Eq.(6) because of the different 
velocities of these bodies, then a particularly interesting part of this acceleration 
difference is proportional to Eu times the Sun’s acceleration, with, as shown 
in Figure 1, V being the velocity of the Earth relative to the Sun, and u the 
velocity of the Moon relative to Earth. These terms form deSitter’s Coriollis-like 
acceleration 



5a^ = 2 17dS X u , 



(18) 



with 






27+1 GMs 

2 c2i?3 



RxV. 



(19) 



The effect of this perturbing acceleration on the orbit is primarily an additional 
rate of perigee precession. This is measured by comparing the Moon’s anomalistic 
frequency A (rate of eccentric motion) with its synodic frequency D (rate of 
monthly phase), and with the latter converted into lunar sidereal frequency to 
(orbital rate) by adding to D the annual rate f2 which is provided by results from 
other solar system experiments. Sidereal minus anomalistic frequency of lunar 
motion includes deSitter’s precessional rate as a supplement to the (adjusted) 
Newtonian tidal contributions to perigee precession. These lunar frequencies are 
measured from range signal perturbations whose size grows linearly in time. The 
Moon’s range from Earth includes several dominant oscillatory contributions 



= fe cos A + Tvar COs(2I?) + Tevc COs(2H — A) + . . . , (20) 



with r and e being orbital radius and eccentricity, Tvar being the amplitude of 
solar tidal perturbation called variation, and r^vc being the amplitude of the 
hybrid evection perturbation due to both the solar tidal force and the eccentric 
motion of the Moon. So the least-squares-fit of the LLR data, which yields best 
estimates for the two key lunar frequencies, will involve the parameter ‘partials’ 

^St 

= —t {re sin A — revc sin(2I? — ^)) , (21) 

dA 



dD 



2t (^var sin(2U) + revc sin(2U — yl)) . 



(22) 
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Additional model details bring measurement of the Moon’s out-of-plane motion 
frequency into the picture as well, though this does not change the key point here. 
Because these rate signals grow in time, measurement precision of the deSitter 
precession grows with total time of the LLR experiment, not only because of the 
growing quantity and quality of the accumulated range measurements, but also 
because of the linear growth in signal sensitivity. A most recent fit of the LLR 
data confirms presence of the geodetic precession with precision of 3.5 x 10“^. The 
continued rapid improvement of this measurement into the future is expected. 

The mean precession rate of the Moon’s orbit is measured to better than 
0.1 mas/year, which is less than a part in 10^^ of the orbital rate. Any new 
Yukawa-like force of the type given by Eq.(13) will produce an additional pre- 
cession 5uj of lunar perigee in amount 



5uj 

UJ 



1 

2 GMeM^ 






(23) 



For such a force with the matched range l//r = 2 x 10^° cm, LLR observations 
limit the strength of that force to be less than 10“^^ that of the Newtonian grav- 
ity force! This constraint is shown as the left dotted curve in Fig. 2 and applies 
whether the new force is either metric or non-metric. The curve’s secondary 
hump is due to the solar tidal force’s participation in shaping the lunar orbit. 
The perigee measurement is an excellent supplement (at finite range) of LLR’s 
other key experimental constraint on any new forces, shown by the right curves 
in this figure, which results from measuring the equality of the Sun’s acceleration 
of Earth and Moon. 



Time Evolution of Gravity’s Coupling Strength G 



Time evolution of Newton’s coupling parameter G results in proportional evo- 
lutions for both the radial size and frequencies of the lunar motion. Similar, but 
not identical in detail, effects result when a torque (indicated by L) acts on the 
orbit 



r __G 
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djn „ G 
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LOn G 




(24) 

(25) 



During the initial years of the LLR experiment it has been the mean orbital 
radius signal 








r (t- to ) , 



(26) 



which has been used to measure G; this involved estimating and subtracting the 
part which results from the orbital torque exerted on the Moon by the ocean tidal 
bulges on Earth which, because of friction, lag in angle from the direction toward 
the Moon. The inclination and 18.6 year precession of the lunar orbit’s plane 
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LLR’s Strength Constraints On A New Long Range Force 




Yukawa Range - 1/ ^ - km 

Fig. 2. The constraints provided by LLR on the strength of any additional long range 
force, as a fraction of that of gravity, are shown as a function of the additional force’s 
Yukawa range. The dashed curve results from measurement of the Moon’s perigee pre- 
cession rate; the two peaks reflecting contributions from both the Earth’s gravitational 
field and the Sun’s tidal field. The curves on the right (which horizontally continue to 
‘infinite range’ or inverse-square) result from the measured equality of the Sun’s ac- 
celeration of Earth and Moon. These latter Equivalence Principle results are modeled 
two ways — by a supplementary metric force or by a non-metric force. 



result in a modulation of the tidal contribution to f which helps the separation of 
the two perturbations after accumulation of sufficient years of data. But the data 
set produced by LLR has in recent years become sufficiently extended in time so 
that the range signals associated with frequency shifts, which grow quadratically 
in time, are becoming dominant in the fit for G. 



Sr{t)me 




rn UJn{t - to)'^ sin{uJnt - 9n) ■ 



(27) 



This new era also shows enhanced ability to separate the tidal torque L per- 
turbation from any G perturbation, since the ratio of orbital size and frequency 
shifts produced by the two differ somewhat, as seen from Eqs.(24) and (25). 
And those parts of the lunar frequencies which result from the annual motion 
around the Sun, though fully responsive to an evolving G, are not disturbed by 
tidal torque; this also contributes to the separate measurability of G. Recent fits 
of almost 30 years of LLR data yield excellent measurement constraint on this 
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parameter 

^ - (1 ± 2) 10-12 y-i (2 a) (28) 

This is about 1/35 the observed Hubble expansion rate of the universe. The size 
of the perturbation signals associated with G, which are now growing quadratic 
in time, should permit LLR to continue into the future being at the cutting edge 
of precision in supplying measurements of, or constraints on, G. 



5 The Gravitomagnetic Interaction 

Line G of the complete A^-body gravitational equation of motion given by Eq.(6) 
represents a post-Newtonian gravitational force proportional to the velocities 
of both the bodies in the interaction. In analogy with electromagnetic theory, 
part of this interaction has been called gravitomagnetism and represents a force 
between two ‘mass currents’. From line G of Eq.(6), this acceleration is 

^ Qj^ . 

5a, = {2 + 2-i)^^r,jV,-Vj 

^ Qjjl . 

- ■ Vj V,] . (29) 

^ Gj 

It has been alleged that the presence of gravitomagnetism within the total grav- 
itational interaction has not been experimentally confirmed and measured. Dif- 
ferent experiments have been under development to see explicitly the effects of 
this historically interesting prediction of general relativity. But this gravitomag- 
netic acceleration already plays a large role in producing the final shape of the 
lunar orbit, albeit in conjunction with the rest of the total equation of motion, 
so its presence and strength in the equation of motion can hardly be in doubt. 
Because both the Earth and Moon are moving in the solar system barycentric 
frame — the frame in which the dynamical equations are formulated and then 
integrated into orbits — a gravitomagnetic interaction exists between these two 
bodies which have velocities V (t) and V (t) + u{t), respectively, as seen in Fig.l. 
Perturbations to the Earth-Moon distance from the gravitomagnetic accelera- 
tion result proportional to both and Vu, and they are surprisingly large 

7 GM 

5r{t)me — — X ^ r cos(2D) — 5 — ^F{[2/to) R cos D 
3 c^r 

= — 930 cos(2D) -I- 1300 cosD cm. (30) 

As previously discussed, the amplitudes of the lunar motion at both these periods 
(monthly and semi-monthly) are both determined to better than a centimeter 
precision in the total orbital fit to the LLR data. It would be impossible to 
understand this fit of the LLR data without the participation of the gravito- 
magnetic interaction in the underlying model, and with strength very close to 
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that provided by general relativity (ycR = !)• K has been argued that the lu- 
nar orbit, like near-Earth satellite orbits, could be calculated in the geocentric 
frame in which the velocity V essentially vanishes and gravitomagnetic forces 
are negligible; therefore LLR ‘measurement’ of gravitomagnetism is somewhat 
illusory? But this point of view fails to take local Lorentz invariance (LLI) into 
consideration. The very ability to reformulate the equations of motion in the 
geocentric frame without introducing new frame-dependent terms, depends on 
the LLI of the equations. But it is the entire package of velocity-dependent, 
post-Newtonian terms which includes the gravitomagnetic terms, lines C plus 
D of Eq.(6), that produce the LLI; the Eddington parameter 7 represents the 
only freedom remaining in the structure of this LLI package. Our confidence in 
the exhibited structure of this total LLI collection of velocity-dependent terms 
is proportional to the precision with which the various preferred frame effects in 
the solar system proportional to WV, and Wu have not been seen. LLR, 
itself, has contributed to establishment of LLI through its null measurement of 
the third type of effects proportional to Wu [12]. 



6 Inductive (Inertial) Forces 



Inductive forces, shown on line E of Eq.(6) and closely related to gravitomag- 
netism, also play an important role in determining the final lunar orbit. In such 
forces the acceleration of one mass element induces an acceleration of another 
proximite mass element (e.g., i and j in Fig.l). From line E of Eq.(6) we have 

^ Qjjl . 

+‘i)0‘3 + ■ ( 31 ) 

j¥=i 

These accelerations play a key part in adjusting the inertial masses of the Earth 
and Moon because of their internal gravitational binding energies; the absence 
or an anomalous strength of these inductive forces would translate directly into 
differences between the acceleration rates of these bodies toward the Sun, and 
a resultant polarization of the Moon’s orbit in the solar direction. The forces, 
Eq.(31), acting between the mass elements of Earth, for example, by themselves 
lead to an anomalous polarization of the lunar orbit of very large magnitude 

5r(t)-,= (47 -I- 10/3) 13 cos I? meter. (32) 



If the inductive forces are combined with the other post-Newtonian inertial 
forces shown on line F of Eq.(6), only then does the total inertial self force of a 
body become 
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The first line of this self force is simply the inertial force due to the kinetic energy 
and gravitational binding energy within the body. The second line represents 
contributions to the body’s internal virial which, when totaled over all internal 
fields, vanishes for a body in internal equilibrium and experiencing negligible 
external tidal forces. These self forces of a body are actually an integral part of 
the calculation of the total gravitational to inertial mass ratio of bodies, discussed 
previously. They were explicitly discussed here in order to show the large size of 
the inductive force contributions. 

The inductive forces also act externally between Earth and Moon, producing 
a further polarizing perturbation of the lunar orbit. This perturbation, though 
now only of order the Earth’s gravitational potential at lunar distance rather 
than Earth internal potential, is still orders of magnitude larger than the mea- 
surement precision of this polarization achieved in LLR 

Sr{t)e^ = ^ — R cos D = 100 cos I? cm. (34) 

2 c^r uj 

It is actually almost totally canceled by the non-linear force on the Moon propor- 
tional to the product of the masses of Sun and Earth. Like the gravitomagnetic 
interactions, the inductive interactions have been confirmed in LLR to have their 
general relativistic strength to better than a part in a thousand precision. 
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Abstract. For last thirty years, great advances in the testing of relativistic gravity 
have come from interplanetary radio ranging and lunar laser ranging. With optical mis- 
sion concepts in the interplanetary space, testing relativistic gravity can be improved 
by 3 - 6 orders of magnitude and many solar-system parameters can be measured 
either for the first time or more precisely. After reviewing briefly dedicated optical 
mission concepts — SORT, IPLR and ASTROD together with other optical mission or 
mission concepts which have important implications on testing relativity and astrody- 
namics — HIPPARCOS, GAIA and LISA, we concentrate on a specific mission concept 
— ASTROD to discuss various mission goals and capabilities in detail. ASTROD is 
an optical interferometry mission concept. Optical interferometry missions hold great 
promises for the testing of relativistic gravity and for the measuring of solar-system 
parameters. We discuss the determination of relativistic parameters 7 , f3 and the solar 
quadrupole moment parameter J2, the measurements of solar Lense-Thirring effect 
together with the application of laser astrodynamics to solar system studies — solar 
angular momentum, solar g-modes, asteroid masses, etc. 



1 Introduction 

We have seen great advances in the testing of relativistic gravity for last thirty 
years. This is largely due to interplanetary radio ranging and lunar laser ranging. 
Interplanetary radio ranging and tracking provided more stimuli and progresses 
at first. However with improved accuracy of 2 - 3 cm from 20 - 30 cm and long- 
accumulation of observation data, lunar laser ranging reaches similar accuracy 
in determining relativistic parameters as compared to interplanetary radio rang- 
ing. In broad lines, both solar-system radio ranging and lunar laser ranging have 
tested relativistic-gravity effects to 10“^. If laser ranging can be extended to the 
whole solar system, the precision of testing relativistic gravity will be dramati- 
cally improved. If this is by laser pulse ranging, the precision will be improved 
by 3 orders of magnitude as in SORT (Solar Orbit Relativity Test) [1] and IPLR 
(Interplanetary Laser Ranging) [2]. If this is by laser interferometric ranging, 
the precision could be enhanced by another 3 orders of magnitude [3]. That is, 
the improvement from the present basis would be 3 - 6 orders of magnitude. 

The SORT mission concept is to use laser pulses and a drag-free space- 
craft with a precision clock orbiting around the Sun to measure 7 and J2 (solar 
quadrupole moment parameter) precisely [1]. IPLR mission concept also uses 
laser pulses [2]. 



C. Lammerzahl, C.W.F. Francis, and F.W. Hehl (Eds.): LNP 562, pp. 330-343, 2001. 
(c) Springer- Verlag Berlin Heidelberg 2001 
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The ASTROD (Astrodynamical Space Test of Relativity using Optical De- 
vices) mission concept is to use drag-free spacecraft in solar orbits employing 
laser interferometric ranging techniques together with a constellation of Earth 
orbiting satellites to provide high-precision measurement of relativistic effects, 
better determination of the orbits of major asteroids, improvement in the mea- 
surement of 7 , measurement of solar angular momentum via Lense-Thirring 
effect and the detection of low-frequency gravitational waves and solar oscilla- 
tions in a single mission [3-10]. 

SORT, IPLR and ASTROD mission concepts are dedicated relativity and 
astrodynamical mission concepts using optical devices. There are mission con- 
cepts using optical devices which are mainly dedicated to different purposes, but 
also have important implication on testing relativity and astrodynamics. The 
HIPPARCOS (High Precision PARallax Collecting Satellite) mission gives the 
best determination of the PPN parameter 7 using light; the precision is only 
a factor three short of radio observations [11]. The GAIA (Global Astrometric 
Interferometer for Astrophysics) mission concept [12] proposes to measure 7 to 
1 ppm; this accuracy is similar to a dedicated relativity mission. LISA [13] is 
an optical interferometry mission dedicated to the gravitational-wave detection. 
Testing the equivalence principle [14] and microscopic physics in gravity [15] us- 
ing optical methods in space is also promising. Astrodynamical space missions 
using optical interferometry hold great promises for testing relativistic gravity 
and measuring solar-system parameters. In the following, we concentrate on 
the analysis and discussion of such a mission concept — ASTROD to illustrate 
various goals and capabilities in detail. 

The objectives of the ASTROD Mission are threefold. The first objective 
is to discover and explore fundamental physical laws governing matter, space 
and time via testing relativistic gravity with 3-6 orders of magnitude improve- 
ment. Relativistic gravity is an important cornerstone of physics, astronomy and 
cosmology. Its improved test is crucial to cosmology and modern theories of grav- 
itation including superstring theories. Included in this objective is the precise 
determination of the relativistic parameters (3 and 7 , the improved measurement 
of G and a precise determination of an anomalous, constant acceleration directed 
towards the Sun. 

The second objective of the ASTROD mission is the high-precision measure- 
ment of the solar-system parameters. This includes: (i) a measurement of solar 
angular momentum via Lense-Thirring effect and the detection of solar g-mode 
oscillations via their changing gravity field, thus, providing a new eye to see 
inside the Sun; (ii) precise determination of the planetary orbit elements and 
masses; (iii) better determination of the orbits and masses of major asteroids. 
These measurements give better solar dynamics and probe the origin of our solar 
system. 

The third objective is to detect and observe gravitational waves from massive 
black holes and galactic binary stars in the frequency range 50 /rHz to 5 mHz. 
Background gravitational-waves will also be explored. 
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A desirable implementation is to have two spacecraft in separate solar orbit 
carrying a payload of a proof mass, two telescopes, two 1 - 2 W lasers, a clock 
and a drag-free system, together with an Earth reference system. The Earth 
reference system could be ground stations. Earth satellites and/or spacecraft 
near Earth-Sun Lagrange points. A proposal is submitted to ESA in response 
to call for mission proposals for two-flexi-missions F2/F3 with international 
collaboration [4]. 

In 1993, we have proposed to use laser astrodynamics to study relativistic 
gravity and to explore the solar system [5] . With a multi-purpose astrodynam- 
ical mission proposed in 1994 [6,7], we reached the ASTROD mission concept. 
Our ASTROD mission concept is presented in the 1996 COSPAR Assembly 
[7]. ASTROD in its relation to gravitational-wave detection is presented in the 
TAMA Gravitational- Wave Workshop [8]. ASTROD’s sensitivity in measuring 
G is presented in the Pacific Conference on Gravitation and Cosmology [9] . Pre- 
liminary calculation of asteroid perturbations and investigation in the possible 
determination of asteroid masses through range observations of the ASTROD 
mission is published in Planetary and Space Science [10]. 

In Section 2, we discuss the ASTROD payload concept and its technological 
development requirements. In Section 3, we review the orbit simulation with 
uncertainty distributions for the ASTROD to investigate the accuracy for the 
determination of relativistic parameters 7, j3 and solar quadrupole parameter J2, 
together with other standard solar-system parameters. In Section 4, we discuss 
the method to measure the solar angular momentum and low-Z solar oscillations. 
In Section 5, we discuss the strategy and sensitivity of detecting gravitational 
waves. In Section 6 we discuss alternate mission concepts. In Section 7, we present 
an outlook. 

2 ASTROD Payload Concept and Technological 
Development Reqnirements 

For the realization of the ASTROD mission concept, it is desirable to have a 
fleet of drag-free spacecraft in solar orbits together with an Earth reference 
system. The Earth reference system could be ground stations. Earth satellites 
and/or spacecraft near Earth-Sun Lagrange points. Each spacecraft in solar 
orbit communicates and ranges with the Earth reference system. When two of 
them are close to each other, they can communicate and range with each other. 
For the fleet of spacecraft in solar orbits, a two spacecraft implementation is to 
have each spacecraft in separate solar orbit carrying a payload of a proof mass, 
two telescopes, two 1 - 2 W lasers, a clock, and a drag-free system [6,7]. We call 
this the basic ASTROD. 

With the precision requirement of this mission, drag-free is a must. Field 
Emission Electric Propulsion (FEEP) system is under development in Gentro- 
spazio (Italy) and Bradford Engineering (Holland). Accelerometer is under de- 
velopment in ONERA. 
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Many of the developments for LISA [13] can be applied to ASTROD. Laser 
and telescope are two important items. Due to the absolute requirement, AS- 
TROD needs a demanding laser metrology system and the proof mass needs to 
be specially designed. 

In JPL, space flyable Hg+ frequency standards with 5 kg mass estimate are 
under development. The demonstrated frequency stability to below 10“^® of 
existing Hg’*' standards is hoped to be maintained or improved. An absolute 
stabilized laser can also serve as a clock. More thorough study of the payload 
concept needs to be done in the near future. 

There are several technological developments required for this mission: 

(i) Weak light phase locking to 100 fW incoming light, 

(ii) Heterodyne interferometry schemes and data-analysis technique, 

(iii) Light-weight precision space clock and/or absolute-stabilized laser to 10“^®, 

(iv) Accelerometer with noise level 10“^^ - 3x 10 m/sVilz from 50 ^Hz - 5 
mHz and absolute stability of 10“^^ - 10“^® m/s^, 

(v) A laser metrology system to monitor the positions of various parts of the 
spacecraft to facilitate gravitational modelling. 

With 100 fW, there still are 5 x 10® photons/s. This is enough for 100 kHz fre- 
quency tuning. At present. University of Glasgow has achieved 85 pW weak-light 
phase-locking, and National Tsing Hua University has achieved 2 pW weak- 
light phase-locking with 0.2 mW local oscilator. Further improvement is needed 
in this aspect. Heterodyne interferometry schemes has been proposed by JPL 
people [16] for unequal arms. These schemes can be adapted to the ASTROD 
mission. Data-analysis for detecting gravitational waves and solar oscillations 
are demanding and needs to worked out. Laser stabilization is a subject under 
active study in University of Konstanz and University of Tokyo. Accelerometer 
requirement is very severe and needs to be developed in steps. This is in the 
expertise of ONERA. The absolute requirement may need to be implemented 
using absolute laser metrology. 

3 Orbit Simulation 

From the mission orbit design process, we have reached the following orbits for 
the two spacecraft. Two spacecraft are to be launched with one going into an 
inner orbit and the other one going into an outer orbit. After two and half years, 
both will be on the other side of the Sun opposite the Earth and stay near the Sun 
in apparent position with certain time on one side of the Sun and certain time on 
both side of the Sun for a total period of 120 days. Second-order relativistic light 
retardation and solar angular momentum can be measured during this period 
of time. During working time of the two spacecraft, gravitational waves can be 
monitored. 

In order to specifically calculate the orbit, we assume an initial spacecraft 
location in geostationary orbit, one located at the midday longitude and one 
at midnight, and we assume they are launched from these locations to their 
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To Conduct Shapiro Time 
Delay Experiment and to 
Measure Lense-Thirring 
Effect for 1 50 Days 




To Conduct Gravitational Wave 
Experiment, Determination of 
Orbit Elements, and Measuring G 
for the Whole Mission Time 
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• Initial Time = 2005.6.10.00:00:00 


• Inner Orbit Parameters (Perihelion Distance 0.77 AU): 


Position 


-0.190956-0.996874 0.000123 


Velocity 


0.014800 -0.001206 -0.000062 


• Outer Orbit Parameters (Aphelion Distance 1.32 AU): 


Position 


-0.191062-0.997427 0.000123 


Velocity 


0.019116-0.004547 0.000088 





Fig. 1. Left: The inner and outer orbits of two ASTROD spacecraft in the Sun-Earth 
fixed frame; and the triangles formed by the positions of the two spacecraft and Earth 
at 700, 800, and 900 days of mission time. Top right: The apparent angles of the two 
ASTROD spacecraft located respectively on the inner and outer orbits near two and 
half years after launch. Bottom right: apparant angle Y vs. mission time. 



interplanetary trajectories on June 10, 2005. Each spacecraft orbit is propagated 
using a sixth-order Runge-Kutta method with the force term calculated using 
the JPL DE403 Ephemeris data. The computed results are shown in Fig. 1. On 
the left of Fig. 1 the orbits are drawn in the Sun-Earth fixed frame. On the right, 
the apparent angles of the two spacecraft from 800 to 1034 days after launch as 
viewed from the Earth are shown. During this period, two-way ranging between 
each spacecraft and Earth reference system, and between two spacecraft will be 
implemented. More details of the orbit design process including fine tuning to 
have a second close-up around 7.5 years can be found in reference [17]. 

In reference [18], we have implemented an orbit simulation for ASTROD. 
In this section, we review this simulation. First we establish a post-Newtonian 
ephemeris and extend it to a stochastic one. With an appropriate modelling of 
local acceleration noises and instrumental noises, we then determine the accu- 
racy of parameter determination for /3 (nonlinearity parameter), 7 and J 2 (solar 
quadrupole moment parameter). 
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3.1 Post Newtonian Ephemerides and ASTROD Orbits 

The barycentric metric with solar quadrupole moment is given as follows 
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with Ti = X — Xi , r = Xi — Xj , mi = GMi j (? , Mi ’s the masses of the bodies with 
Ml the solar mass [19]. J 2 is the quadrupole moment parameter of the Sun. z is 
the unit vector normal to the elliptic plane. The solar angular momentum has a 
7° tilt with respect to z. This is marginally detectable in our simulation. To be 
simple, we assume that they are in the same direction. 

The equations of motion of A-mass problem associated with this metric can 
be derived from geodetic variational principle and used to build our computer- 
integrated ephemeris (with 7 = /3 = 1 , and J 2 = 2 x 10 “^) for nine-planets, 
the moon and the sun. The positions and velocities at the initial time, 2005.6.10 
0:00, are given in advance by the DE403 ephemeris. The evolution is solved 
by using the 4th-order Runge-Kutta method with the stepsize, h = 0.01 day. 
The stepsize, h = 0.01 day, is chosen such that the computer efficiency and the 
numerical accuracy can both approach the requirement. With this tiny stepsize, 
the numerical discrepancy is within about 10“^^ AU for all positions and 10“^^ 
AU/day for velocities. This ephemeris is then used to calculate the orbits of 
inner and outer spacecraft, and the light travel times and Shapiro time delays. 



3.2 Simulation of ASTROD Ranging Data 

Now, we use a stochastic model to generate simulated ASTROD ranging data. 
The ranging observation Zj, at time instant tk is expressed as an explicit function 
h(u, tfe) of uncertain parameters plus a superposing noise term Vfe as follows, 

Zk = ^(u, tk) + Vk , (2) 

where u=(ui, U2, ...) are uncertain parameters to be estimated, e.g., ui = 7, U2 = 
13, M 3 = J 2 etc. u normally includes masses and initial states of planets. However, 
for simulating the uncertainties of relativistic parameter determination, we only 
need to include a few such parameters. 

In the noise term Vfc, we include two types of errors: 

( 1 ) the uncertainty due to the imprecision of the ranging devices. 
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(2) unknown accelerations due to the imperfections of the spacecraft drag-free 

system. 

We treat the first type of error as white Gaussian random noise superimposed 
into the ranging measurement. The present achievable timing error is 50 ps [20]. 
If there are interruptions of interferometric signals, this timing error would con- 
stitute a limit for error reductions. Thermal expansion errors and all other errors 
due to optical devices are required to be below this level. Hence, conservatively, 
we take 50 ps to be the standard deviation of this type of errors. We assume the 
mean of this error to be zero. If the interferometric signals are continuously mon- 
itored, 1 fs (subwavelength) resolution is there; when corresponding magnitudes 
of other errors can be minimized, great improvement can be envisaged. 

As to the second type of errors, there are unknown accelerations fixed or con- 
fined in directional angles with respect to the spacecraft orientation or spacecraft- 
Sun orientation and unknown accelerations truly random in directions. In this 
paper, we only consider the unknown accelerations truly random in directions. 
For modelling, we assume the distribution of these accelerations is Gaussian in 
magnitude with a = 10“^® m/s^ and change directions every 4 hours (equivalent 
to 10“^^ m/s for / ^ 10“^ Hz). We will consider the others in a separate 

paper. To generate a sequence of simulated observations, the simulation pro- 
gram numerically integrates the equations of motion adopted for the stochastic 
model. The equations of motion are unchanged for all solar objects except the 
spacecraft. For the spacecraft, the equations of motion are modified by adding 
the noisy accelerations. After the integration and the total light travel time is 
solved, the first type of noise (50 ps-half-width Gaussian) is added to obtain the 
ranging simulation. The deviations of two simulated rangings from the fiducial 
ranging for the inner spacecraft are shown in Fig. 2. 

3.3 Estimation of Parameters 

We use sequential Kalman filter data processing [21] to obtain the estimated 
uncertainty of the relativistic parameters to be determined from the mission as 





Fig. 2. Deviations of two simulated rangings from the fiducial ranging for the inner 
spacecraft. The smooth curves are the ± standard deviation of simulations vs. epoch. 
For 800-1050 days of the mission, the time density is 10 times higher. 
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Fig. 3. o^, a/3, aj-2 as functions of mission epoch. 



a function of epoch. The relativistic parameters /3 and 7 , the solar quadrupole 
moment parameter J 2 together with initial state of the spacecraft and mass 
parameters of Sun, Mercury, Venus, Earth system. Mars and Jupiter are esti- 
mated. The uncertainties of /3, 7 and J 2 versus mission epoch estimated for inner 
spacecraft are shown in Fig. 3. With the 50 ps ranging uncertainty, 7 can be de- 
termined to 7.7 X 10“^. When ranging uncertainty is decreased, better resolution 
of 7 in the range 10“^ — 10“® is possible. In addition, (3 and J 2 can be determined 
to 10“® and 4.5x10“® respectively, with 50 ps ranging uncertainty and 10“^® 
m/s^ (time scale: 4 hours) accelerometer random noise uncertainty. 

Extension of simulation [22] to the determination of the masses of three big 
asteroids (Ceres, Vesta and Pallas) [10], the rate of change of the gravitational 
constant, and an anomalous constant acceleration directed towards the Sun [23] 
has been completed with good results. 

4 Solar Angular Momentum and Solar Oscillations 

To measure the solar Lense-Thirring effect, we need the difference, t\ — t 2 , in 
the two round trip propagations, ERS (Earth Reference System; Earth-system 
basis) — >■ S/C 1 (Spacecraft 1) — >■ S/C 2 (Spacecraft 2) — >• ERS and ERS — >■ S/C 
2 — >■ S/C 1 — >■ ERS. Newtonian calculation of ti — t 2 for 800 - 1034 days after 
launching gives about 10 ms. The Lense-Thirring effect variation for this period 
of time is about 100 ps and has a totally different signiture. For a laser stability 
of 10 “^®- 10 “^®, the accuracy for measuring the round-trip time difference ti —t 2 
(about 10 ms) is 10 as - 1 fs and the sensitivity of measurement of the Lense- 
Thirring effect is 10“^ - 10“® by means of this interferometric measurement. 
From a preliminary theoretical modelling, we advocate that the Lense-Thirring 
effect can be determined to 0 . 1 % or better. 

Since the solar Lense-Thirring effect is proportional to the solar angular 
momentum, this measurement of the solar Lense-Thirring effect will give an 
accurate value of the solar angular momentum. At present, there is about 10% 
uncertainty [24] in the solar angular momentum in various solar models [25-28] . 

The strain sensitivity of ASTROD spacecraft will reach 10“^° - 10 “^^/ -\/Hz in 
the 50 /xHz - 5 mHz frequency range. The low I p-modes solar oscillations will be 
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readily detectable through their changing gravity field if there is no background 
“noise” masks these signals. The g-mode sensitivity of ASTROD is about 2 
orders of magnitude better than LISA and SOHO. ASTROD will have a good 
chance to detect and observe these g-mode oscillations. This will create a new eye 
to see inside the Sun. However, all will depend on whether the background noise 
can be resolved. Hils, Bender and Webbink [29] estimated that there are about 
3x10® short-period white-dwarf binaries producing a large gravitational wave 
background in the relevant frequency band. The recent discovery of two white- 
dwarf binaries [30,31] (periods: 3.47 h and 4 h) give input for a better estimate of 
the population. We are analyzing in the following how this background noise can 
be lifted for the gravitational-wave detection of the solar g-mode, p-model and f- 
mode oscillations. From more precise orbit determinations, the solar quadrupole 
moment and higher moments can be measured. This together with solar angular 
momentum measurement and solar oscillation measurement constitute a gravity 
eye to see inside the Sun. 

Efforts to see inside the Sun have stimulated important works in solar neu- 
trino experiments and helioseismological observations. Recently, Schutz [32], 
Gough [33], and Cutler and Lindblom [34] suggested the possibility that solar 
oscillations might be observable by measuring their associated gravitational per- 
turbations. With a detailed analysis. Cutler and Lindblom [34] concluded that 
LISA may be confusion limited at the relevant frequencies due to the galactic 
background from short-period white dwarf binaries and present estimates of the 
number of these binaries would require the solar modes to have energies above 
about 10®® erg to be observable by LISA. 

ASTROD has better sensitivity in the frequency-band considered. In the first 
TAMA Workshop on Gravitational Wave Detection, we reported that the confu- 
sion limit might be lifted for the gravitational detection of the solar oscillations 
for ASTROD due to higher strain sensitivity and its orbit configuration [8]. In 
the third Amaldi Conference on Gravitational Waves, we present a scheme for 
the sparation of the gravitational-wave signals and the solar oscillation signals 
[35]. We review the scheme in the following. 

The change of external gravitational potential, due to the mode 

oscillation of the sun can be written in the following simple form: 






4 ^ 

2,£ 1 Rq i?Q 







(3) 



The surface helioseismology favors p-mode detection while gravitational he- 
lioseismology favors p-mode detection. Table 1 compiles some important param- 
eters for selected solar oscillations. 

For the ASTROD, the strain sensitivity for one year intergation for Signal- 
to-Noise-Ratio S/N = 5 threshold is 10“^® in the frequency range 100 /xHz - 1 
mHz. With this strain sensitivity, the £ = 2 mode detection threshold is about 
1 mm for This is about 2 orders of magnitude more sensitive than LISA. The 
time constants for solar oscillations are long — over 10® yr for low I g-mode 
oscillations [36] and over 2-3 months for low I p-mode oscillations [37]. The 
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distance to the Sun of the inner ASTROD spacecraft varies from 0.77 AU to 1 
AU and that of the outer ASTROD spacecraft varies from 1 AU to 1.32 AU. 
When the spacecraft move in solar orbits, the amplitude and direction of the 
solar oscillation signals receive deep modulations in addition to the modulations 
due to spacecraft motion and orientation. The time constant for the gravitational 
radiation (or orbit evolution) of the Close White Dwarf Binaries (CWDB) is more 
than 10® yr, and hence the CWDB confusion background is steady in the inertial 
space. This background is modulated only by the orientations and motions of 
spacecraft, not by the distances and orientations of the spacecraft relative to 
the Sun. With this extra modulation — deep in magnitude and direction, the 
detectability of the solar oscillation signals reaches at least 5 orders lower than 
the confusion limit, i.e., to the instrumental noise floor. 



Table 1. Frequencies, energies, and surface velocity amplitudes of various 1=2 modes 
of solar oscillations with 5=lmm [35] 



Mode 


Frequency (/^Hz) 


Energy (erg) 


Surface radial velocity (mm/s) 


g3 


220.4 


2.82 X 10^'^ 


0.183 


g2 


254.0 


3.93 X 10^'^ 


0.308 


g4 


192.2 


4.08 X 10^'^ 


0.138 


Pi 


381.6 


5.13 X 10^'^ 


1.15 


P2 


514.4 


3.56 X 10^® 


8.20 


P3 


663.6 


1.96 X 10^® 


35.5 



5 G— Wave Detection 

For gravitational- wave detection, we compare the following two paths through 
the whole missions except for the period 850 - 1000 days: (i) ERS — >■ S/C 1 
^ ERS ^ S/C 2 ^ ERS; ERS ^ S/C 2 ^ ERS ^ S/C 1 ^ ERS. The 
time difference of the two paths varies up to 0.34 s. A laser linewidth of 1 Hz 
would keep the interference coherent. The sensitivity 10“^®-10 “^^/-\/Hz would 
be comparable to that of LISA, but more sensitive to lower frequency due to 
long path. The round-trip interference for measurement of the Lense-Thirring 
effect also has some sensitivity on the gravitational wave [8]. Various methods 
of Doppler tracking and noise substraction schemes [16] can also be applied here 
for detection of gravitational waves. 

Now we discuss the ASTROD sensitivity for gravitational waves following our 
presentation in the LISA Symposium (ref. [21] in [8]). Let the distance between 
the spacecraft (S/C 1) in the inner orbit and the ERS be ^i, and that for the 
spacecraft (S/C 2) in the outer orbit and the ERS be l2- To make a Michelson 
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S/C 1 



Fig. 4. Gravitational-wave detection topology. Path I: ERS — >■ S/C 1 — >■ ERS — >■ S/C 
2 ERS. Path II: ERS ^ S/C 2 ERS ^ S/C 1 ERS. 

interferometer from these two arms would require a linewidth of 1 mHz. To 
minimize the armlength difference, we propose to compare the following two 
paths through the whole mission except for the period 850 - 1000 days: (I) 
ERS^S/C l^ERS^S/C 2^ERS; (II) ERS^S/C 2^ERS^S/C I^ERS. The 
time difference of the two paths varies only up to 0.34 s. A laser with linewidth of 

1 Hz would keep the interference coherent. The phase sensitivity of this topology 
to gravitational wave is calculated as follows. In the plane containing S/C 1, S/C 

2 and ERS (for simplicity, we use LI as ERS), draw a reference line LI— >-A as in 
Fig. 4. Let the angle between ERS— >-S/C 1 and ERS— >-A be 9i, and that between 
ERS— >-S/C 2 and ERS— >-A be 6 * 2 . First consider a monochromatic gravitational 
wave with + polarization and strain amplitude coming in perpendicular to 
the orbital plane. The change of length scale in the LI— >-A direction at time t is 
h+ sin(27r/Gf). For laser light travelling through Path I and Path II to return at 
t simultaneously, the optical path difference is 

Al = 4/i+(c//g)(cos(20i) + cos(202))x 

[cos(27t/g(ti - T2 )) + cos(27r/G(ri + T2))] cos(27r/(t + (j)o )) , (4) 

with Ti = 2lijc, and T 2 = 21^1 c. Hence phase difference of the laser light is 

A<i> = 4/i+(///g)(cos(20i) + cos(26»2))x 

[cos(27t/g(ti - T2 )) + cos(27r/G(ri + T2))] cos(27r/(t + </)o)) . (5) 

This is the laser phase senstivity for our gravitational detection topology. For 
X polarization, simply rotate by 45°. For non-perpendicular incidence, insert a 
cosine factor. 

The theoretical sensitivity limit (shot noise limit) in the strain for gravitation- 
al-wave detection is inversely proportional to with P the received power 

and I the distance. Since P is inversely proportional to P and P^^^l is constant, 
this sensitivity limit is independent of the distance. For 1 - 2 W emitting power, 
the limit is 10 “^^/-\/Hz. As noted in the LISA study, making the arms longer 
shifts the whole time-integrated sensitivity curve to lower frequencies while leav- 
ing the bottom of the curve at the same level. With the same laser power, the 
ASTROD sensitivity would be shifted to lower frequency by a factor up to 60 
(30 in average) if the technological requirements in Section 2 are met. 
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For one-spacecraft ranging with one leg on earth, 10 ps accuracy in the 
transit time variation is achievable. With 10® - 10® Hz repetition rate and a 
distance of 2 AU, the sensitivity can reach 10 “^^/-\/Hz when an ultrastable clock 
of comparable stability is available in the future. In the T2L2 project study of 
satellite time transfer by laser light, precision and accuracy are aimed at the 
order of 10 ps. For gravitational-wave detection, this implies a sensitivity of 

io-i6_io-i7/Vife. 

6 Mini— ASTROD and Snper— ASTROD 

ASTROD is a relativity mission concept encompassing multipurposes. With its 
technological requirements achieved, it would give a gravitational-wave sensi- 
tivity similar to LISA, but shifted to lower frequencies. ASTROD would com- 
plement LISA in probing the early Universe and study strong-held black hole 
physics. 

A down-scaled version, i.e., Mini-ASTROD with one spacecraft carrying 
a payload of a telescope, two lasers, and a clock will test the optical scheme 
and yet give important scientihc results. These scientihc results include a better 
measurement of 7 to 1 ppm [38], a better sensitivity (1-2 orders better) in using 
optical Doppler tracking method for detecting gravitational wave [39,40], and a 
potential of measuring the solar angular momentum via Lense-Thirring effect 
[41]. It is important to do things in appropriate steps and we are now studying 
this mini-ASTROD mission concept in more details. 

With the advance of laser technology and the development of space interfer- 
ometry, one can envisage 15 W (or more) compact laser power and 2-3 fold 
increase in pointing ability. With these developments, one can increase the dis- 
tance from 2 AU for ASTROD to 10 AU (2x5 AU) and the spacecraft would 
be in orbits similar to Jupiter’s. Four spacecraft would be ideal for a dedicated 
gravitational- wave mission (super-ASTROD). 



7 Outlook 

Optical space missions will be important in testing relativistic gravity and mea- 
suring solar-system parameters. Laser Astrodynamics in the solar system en- 
visages ultraprecision tests of relativistic gravity, provision of a new eye to see 
the solar interior, precise measurement of G, monitoring of the solar-system 
mass loss, and detection of low-frequency gravitational waves to probe the early 
Universe and study strong-held black hole physics together with astrophysics of 
binaries. One spacecraft and multi-spacecraft mission concepts — SORT, Mini- 
ASTROD, ASTROD and Super-ASTROD are in line for more thorough mission 
studies. In view of their importance both in fundamentals and in technology de- 
velopments, mission concepts along this line will be realized and fruitful. 
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Clocks for Length and Time Measurement 
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1 Introduction 

The evolution of various fields of science, technology, trade or legal metrology 
is intimately connected with the ability to relate measurements with each other 
that were performed at different places and different instants of time. For this 
purpose a practical system of units of measurement i.e. the International System 
of Units (SI) has been established by international cooperation [1]. In this SI, the 
metre and the second represent the base units of length and time, respectively. 
From all units these two can be realized with by far the highest accuracy since 
they are based on frequency measurements and most accurate clocks. In contrast 
to clocks based on mechanical properties of macroscopic bodies, e.g., pendulum 
clocks, quartz clocks or pulsars, the frequency reference for a suitable oscillator 
in atomic clocks is mainly determined by the intrinsic properties of suitable ab- 
sorbers like atoms, molecules or ions. These atomic properties are determined 
by fundamental constants resulting from the basic interactions between the ele- 
mentary constituents of matter. Following the generally accepted idea that the 
properties of each atomic absorber of a selected species are the same, identical 
clocks can be set up in any desired number and at any desired place. 

Consequently, the definition of the second is based on an atomic frequency: 
The second is the duration of 9 192 631 770 periods of the radiation corre- 
sponding to the transition between the hyperfine levels of the ground state of the 
caesium- 133 atom [2]. In the SI, this definition is also the basis for the length 
unit which is related to the second by a fundamental constant i.e. the speed of 
light. The metre is the length of the path travelled by light in vacuum during a 
time interval of 1/299 792 458 of a second [3]. This definition can be applied 
directly to measure large distances, the distance to the moon, e.g., as has been 
done with unprecedented accuracy (see contributions of K. Nordtvedt, this vol- 
ume on p.317). The large value of the speed of light, however, prevents one from 
measuring distances on a laboratory scale with the required accuracy. The def- 
inition corresponds to a fixed speed of light of c = 299 792 458 m/s and at the 
same time fixes the ratio between the vacuum wavelength A and the frequency 
n = cfX of a, light wave. Hence, the vacuum wavelength of a light wave used for 
accurate interferometric distance measurements is known, in principle, with the 
same accuracy as the frequency of the light wave. Consequently, time measure- 
ments as well as distance measurements rely on frequency standards or clocks, 
i.e. oscillators with known frequencies. At present, clocks that are used for time 
measurements predominantly operate in the radio frequency (rf) regime whereas 
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for length measurements optical clocks are used. In the context of general rel- 
ativity, the definitions of the second and the meter have to be considered as 
units of proper time and proper length, respectively. Depending on the required 
accuracy, it is often necessary to apply general relativity to metrology [4] . 

In this lecture. Sect. 2 concentrates on the basics of atomic clocks and re- 
views the status of rf clocks (Sect. 3) and optical clocks (Sect. 4). In Sect. 5 
an example of the required frequency links between both regimes is given that 
now allows to use rf clocks and optical clocks in the same way for length and 
time measurements. Sect. 6 concludes with optical frequency standards for the 
realization of the meter. 



2 Basics of Atomic Clocks 



2.1 Definitions 



The definition of the second [2] does not include a definite instruction how to 
build a clock. However, the statement “the Caesium-133 atom” makes some im- 
plicite assumptions. First, the transition frequency is assumed to be the same in 
all Caesium-133 atoms used in the clock. This requires that the clock is operated 
in such a way that the ideal conditions of atoms isolated and at rest are fulfilled. 
In reality, these ideal conditions can be realized only approximately and there 
will be frequency offsets due to spurious effects. The accuracy of a clock can be 
measured only if a more accurate clock is available. Otherwise, as it is the case for 
the best primary standards the magnitude of the uncertainty of the knowledge 
about these offsets is a measure of the accuracy (or uncertainty) one has to at- 
tribute to a real clock [5] . In this sense, the uncertainty represents the estimated 
magnitude of a possible deviation of its frequency from the unperturbed atomic 
frequency. Within a limited measurement time r the actual mean value of the 
frequency of a clock may differ by a larger magnitude as compared to the one 
given by the uncertainty resulting from various noise processes associated with 
the realization of the clock e.g. by the shot-noise of the atomic sample or noise 
in the electronics. A generally agreed measure of the frequency instability of a 
clock is given by the relative two-sample standard deviation, or Allan standard 
deviation CTj,(2,t) [6] 




Here, y/. is the average value of the normalized frequency difference y(t) in an 
arbitrary time interval tk to tk + t between the clock under investigation and a 
suitable reference. The normalized frequency is used rather than the frequency 
to allow the comparison of the performance of clocks operating at very different 
frequencies. For short times t the frequency stability is often limited by the 
counting statistics 



CTy(2, t) = const X 



Qx |x 




(2) 
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where Q = vjAv is the quality factor of the resonance depending on the 
linewidth Av and the center frequency v, SjN is the signal-to-noise ratio in 
a bandwidth of 1 Hz. The constant depends on the lineshape of the resonance 
line and is of the order unity. 

2.2 Perturbing Effects and Remedies 

The ideal reference in an atomic clock or frequency standard would therefore 
comprise of an absorber at rest in an environment free of perturbations by other 
particles or fields with a high line quality factor Q and a strong signal when 
probed by an external oscillator. The art of making a perfect clock depends on 
the ability to realize this ideal situation as close as possible. 



Limited Interaction Time The basic limitation to the quality factor Q is 
given by the frequency uncertainty Av = 1/T resulting from the time T the 
absorbers are interrogated. Consequently, one is interested first to use absorbers 
with narrow line-width transitions and second to allow for the required long 
interaction time of the external field with the quantum absorber. Such long- 
lived states are provided e.g. by the ground states of atoms and ions splitted due 
to the hyperfine interaction leading to frequency separations in the rf domain up 
to several 10 GHz (see table 1). Furthermore, atoms, ions or molecules sometimes 
have long-lived electronic states with optical transitions corresponding to several 
hundred terahertz (see e.g. Table 2). 

To utilize the potential of these lines one has to allow for an interaction time 
of milliseconds and above. Long interaction times can be achieved by several 
methods that find application in various frequency standards. There are long 
beam machines based on atomic or molecular beams of low velocities together 
with separated field excitation [7] (Ramsey excitation) as used in the the con- 
ventional Cs atomic clocks (Sect. 3.1). In the hydrogen maser the atoms are kept 
inside the interaction volume by containing them in a bulb. Laser cooled ballis- 
tical atoms allow easily for interaction times of a millisecond and this time can 
be increased to about a second in a so-called atomic fountain (Sect. 3.3). The 
longest interaction time can be achieved with ionized particles in a so-called ion 
trap in the harmonic potential well (see Sect. 3.4). 



Velocity of the Absorbers Even if an absorber with a very small natural 
line width has been chosen the center of the resonance line and its width can 
be seriously affected by the velocity of the absorber. The shift of the absorbed 
frequency resulting from the first-order Doppler effect Aviv‘s order) = k ■ u/27t 
can be as high a 10“® for thermal absorbers. This shift depends on the direc- 
tion of the wave vector k and on the direction of the velocity v of the ab- 
sorbers. Consequently, its influence can be largely avoided by a perpendicular 
alignment of both vectors as it is done in a beam apparatus. Other methods 
determine the influence of the first-order Doppler shift and subsequently cor- 
rect the corresponding frequency offset e.g. by re-directing one of the vectors 
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by a so-called beam reversal technique. These methods cannot be applied to 
the second-order Doppler shift resulting from the time dilation of the absorber 
which is order) = ■ w^/(2c^) when observed in the laboratory frame. 

For the aforementioned thermal velocities it can be as high as Avjv = 10“^^. 
This shift scales with the square of the velocity of the absorbers and can only be 
reduced by orders of magnitude if absorbers of low velocities are utilized. The 
use of slow absorbers also increases the interrogation time of the field with the 
atoms and therefore leads to a reduced transit-time broadening. 

With neutral atoms two basically different approaches are used to reduce 
the influence of the first-order Doppler effect. The first one selects a group of 
slow absorbers out of a broad velocity distribution e.g. by using transversal ex- 
citation of absorbers in collimated molecular and atomic beams or by selecting 
only the slow molecules out of the thermal distribution [8] leading to a reduc- 
tion of the relevant velocity spread by several orders of magnitude. The second 
approach reduces the width of the velocity distribution by cooling the ensemble. 
The positive result of this approach is that more absorbers are compressed into 
the desired velocity range and the signal used for frequency stabilization of the 
oscillator to be locked is increased. Laser cooling of atoms now represents a well 
established and acknowledged technique [9-11] to reach low velocities. Two-level 
atoms with a fast cycling transition can be cooled to the so-called Dopppler limit 
A;bT = h'y /2, where 7 is the line width of the cooling transition. As an example, 
the root -mean-square velocity corresponding to this temperature is 9 cm/s for 
Cs. Often, laser cooling is performed in a magneto-optical trap (MOT) [12]. In 
a MOT, photons from six orthogonal laser beams with frequencies below the 
atomic resonance are absorbed by the atoms with velocity components coun- 
terpropagating to the laser beams. The subsequent emission of photons from 
the spontaneous decay of these excited atoms is isotropically whereas the ab- 
sorbed photons always have the same momentum k of the laser beam. Hence, the 
atomic velocity after a large number of absorption/emission cycles is reduced. In 
the MOT, a magnetic held increasing into all spatial directions with the zero of 
the held located at the center of the six crossing beams shifts the energy levels 
of the atoms by the Zeeman effect. Atoms outside the center of the MOT expe- 
rience a stronger repelling force due to the bigger Zeeman shift than the ones 
close to the center. Consequently, the combined influence of the light held and 
the magnetic held leads to cooling and trapping of the atoms in a MOT. The 
achievable velocity can be significantly lower as given by the Doppler limit in 
the case of multi-level atoms, due to the mechanism of the so-called polariza- 
tion gradient cooling [10] leading to an rms velocity of 1.1 cm/s in Cs. Raman 
cooling methods can achieve even lower velocities [13]. Laser cooling works also 
very efficient for trapped ions [14], where specific cooling methods for the bound 
ions as e.g. resolved-optical-sideband cooling allow to cool a single ion close to 
the zero point energy [15]. 



Interaction with Fields and Particles The interaction of the absorbers with 
external fields perturbs the quantum states of the absorbers and consequently 
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Table 1. Ground-state splitting due to the hyperfine structure 



H 


1 420 405 751.770 Hz ± 0.003 Hz 


H-maser 


[5] 




6 834 682 610.904 29 Hz ± 0.000 09 Hz 


Rb-clock 


[19] 


i^^Cs 


9 192 631 770 Hz 


Cs-atomic clock 


[5] 


171 Yb+ 


12 642 812 118.466 ± 0.002 Hz 


Yb'*'-ion trap 


[20] 


199Hg+ 


40 507 347 996.841 59 ± 0.000 44 Hz 


Hg^-ion clock 


[21] 



leads to frequency splitting, shifts or broadenings that, even small, often cannot 
be tolerated when the highest accuracy is aimed at. Examples include Stark and 
Zeeman shifts resulting from static magnetic and electric fields but also from ac- 
Stark shifts as a result of trapping fields, stray light or thermal radiation [16]. 
The interaction with other particles of the same species [17] or of other species 
[18] may lead to collisional shifts. The degree to which these influences can be 
suppressed and corrected for is reflected in the achieved accuracy of a clock. 

3 Clocks Based on rf Transitions 

A number of excellent frequency stan- 
dards are based on the hyperfine sep- 
aration of the ground state of atoms 
and ions (see table 1). This splitting is 
due to the coupling between the mag- 
netic moment associated with the total 
spin J of the electron shell and the nu- 
clear spin I. In ^^^Cs e.g. the coupling 
of / = 7/2 h and J—\j2 h leads to the 
F = 7-kJ = 4andF = /- J = 3 
states which split in the magnetic held 
(Fig.l). The Cs clock uses the tran- 
sition with the smallest sensitivity to 
magnetic fields, i.e. between the F = A,mF = 0 and F = 3,mF = 0 states with 
a quadratic dependence of Af = 0.0427 x Hz/(|uT)^. 

3.1 Cs Beam Machine 

Most Cs atomic clocks operated today use Cs atoms in an atomic beam inside 
a high vacuum chamber. Their principle (Fig. 2) does not differ much from the 
ancestor of the Cs atomic clocks developed at the National Physical Labora- 
tory, England [22]. The Cs atoms effusing from an oven kept at a temperature 
of about 450 K have a most probable velocity of about 250 m/s. Due to the 
small energy separation between the F = 3 and F = A states both levels are in 
fact almost equally populated in the thermal beam. Detection of the transitions 




Fig. 1. Hyperfine structure of Caesium in 
a magnetic field 
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between these levels induced by an external oscillator can be performed with an 
optimal signal-to-noise ratio in a large sample of atoms only if most atoms are 
initially in the same state. It is therefore necessary either to prepare all atoms in 
the same state or to select atoms from one state. The latter procedure is applied 
in conventional Cs atomic beam machines in which a magnetic polarizer with 
an inhomogeneous magnetic field realized by a two-, four-, or six-pole magnet 
is used to deflect the atoms according to their magnetic moment. Only atoms 
in the state F = A enter the first interaction zone of the Ramsey cavity fed 
by the rf derived from a voltage controlled quartz crystal oscillator (VCXO). 
The interaction with the rf held transfers the atom into a coherent superposi- 
tion between the F = 4 and the F = 3 state. The temporal evolution of the 
quantum mechanical state of the Cs atom can be thought of as an oscillation 
with a frequency corresponding to the energy difference of these states. After the 
interaction of the atom with the held in the second zone of the Ramsey cavity 
the probability to And the atom in the F = 4 state or in the F=3 state depends 
on the external rf field being in phase or out of phase with the atomic oscillator 
at the second interaction zone. Hence, the number of atoms in either the F = 4 
or in the F = 3 state oscillates as a function of the frequency of the external 
oscillator thereby leading to the Ramsey interference structure [23] similar to 
the ones shown later (Fig. 4 and Fig. 5). In Fig. 2 the magnetic analyzer deflects 
the atoms in the F=3 state into the detector. The detector signal is used to 
stabilize the frequency of the VCXO to the atomic transition frequency. The 
largest contributions to the frequency uncertainties in these clocks results from 
cavity phase shifts due to the imperfect field distribution in the cavity and from 
inhomogeneities of the magnetic C-fleld. As a representative of the state-of-the 
art primary beam-machine clocks, PTB’s CSl has a stability of 5 x10“^^a/t/s 
and an accuracy of 7 xlO“^® [24]. 

3.2 Optically Pumped Cs Clocks 

The short-term stability of the Cs beam machines is limited by the shot noise of 
the number n of atoms in the beam since the signal-to-noise ratio (2) increases 
with ^/n. The stability can be increased if all atoms can be prepared in the 
desired state rather than selecting one particular state with only 1/16 of the 
total number of atoms leaving the oven (see Fig.l). Optical pumping techniques 
can be utilized to accumulate atoms in the desired state. Similarly, the detection 
of the excited atoms can be performed by laser induced optical fluorescence. 
Several Cs beam machines based on these principles have been designed and 
operated (see e.g. [25-28]). A stability of about 1 x10“^^-\/t/s [29,28] and an 
accuracy of 1 xlO”^"* has been reported [28] for these types of clocks. 

3.3 Atomic Fountain Clocks 

Laser cooling of Cs atoms can lead to very low temperatures of about 2 /iK 
corresponding to velocities of a few centimeters per second that might allow in- 
teraction times of seconds and more. The utilization of such a long interaction 
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time meets with difficulties, because gravitation accelerates the atoms and, in 
contrast to ions to be discussed in more detail in Sect. 3.4 and Sect. 4.1, up to 
now no means have been found to keep neutral atoms at rest in the gravitational 
field without seriously perturbing the resonance frequency. Applying Ramsey’s 
separated-field technique to ballistically free flying atoms leads to the develop- 
ment of a of a fountain clock in which interaction times of about a second can 
be realized. In an atomic fountain a cloud of cold atoms is launched vertically 
through an interaction region with a velocity of a few meters per second. The 
gravitational acceleration g forces the atoms to slow down and fall back thereby 
passing the same interaction zone a second time. Similarly, as in the Cs atomic 
clock with a thermal beam, the resulting resonance feature displays a Ramsey 
interference structure with a resolution that is determined by the time T be- 
tween the two interactions. This time is calculated from the time T' necessary 
for the atoms to climb up to the apogee and the same time to fall down 




For a realistic apparatus with a height between the resonator and the apogee of 
H= 1 m this time is T = 0.9 s and the necessary starting velocity of the atoms 
is n = \/2gH = 4.5 m/s. The low velocities of the atoms in the atomic cloud are 
prerequisites for an efficient operation of such an atomic fountain inasmuch as 
the number of atoms returning into the microwave cavity after the ballistic 



Polarizer Magnetic Analyzer 

magnet Shield magnet 




Fig. 2. Schematics of a Cs atomic clock. The field lines of the homogeneous magnetic 
C-field are perpendicular to the paper plane. The states Ei and E 2 correspond to all 
E = 3 states and the F = 4, mp = —4 state and to the F = 4, —3 < mp < 4-4 states, 
respectively (see Fig. 1). 
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flight depends on the transverse ve- 
locity. From a cloud of caesium 
atoms with a temperature T = 
2 about 10% of the atoms And 
their way through the 1 cm hole of 
the resonator in contrast to about 
0.2% that would enter the same re- 
gion if the sample was cooled only 
to the Doppler limit of 125 /rK 
[30]. The first successful predeces- 
sor of an fountain experiment us- 
ing laser cooled sodium atoms in 
a pulsed rf cavity demonstrated a 
line width of 2 Hz [31]. Following 
the realization of the first fountain 
clocks [32,33] it has been realized 
soon that collisions between the 
cold atoms may limit the achiev- 
able accuracy in a Cs fountain [17]. 
It is now known that the associated 
shifts are much less pronounced in 
rubidium [34,35]. Now a variety of 
atomic fountains using Cs or Rb atoms are operated or under investigation for 
atomic clocks in several institutions [36-41,34,35,42]. Even though these real- 
izations differ somewhat in their design, each apparatus uses three sections (see 
Fig. 3), a trapping and cooling section, an excitation section containing the mi- 
crowave cavity and the zone for the ballistical flight and a section for the detec- 
tion. The cloud of cold atoms is often prepared from a thermal vapour fed by a 
Cs reservoir at a base pressure of about typically 10“® Pa. In a MOT [12] about 
10^ Cs atoms are collected and further cooled in an optical molasses to about 
2 /rK and sometimes further cooled by means of Raman cooling techniques [43] . 

In the next step the atoms have to be launched upward without increasing 
their temperature. This is conveniently done in a moving molassis where the 
frequency v\ of the downward pointing vertical laser beam is red detuned by 5v 
and the frequency V 2 of the upward pointing beam is blue detuned by 5i> with 
respect to the original frequency v. The corresponding phase front e.g. the ones 
for 5ujt + kz = 2'K5vt + 2 t:zI\ = 0 move with the velocity v = zjt upwards. The 
atoms are now exposed to this “walking wave” with velocity 

V = \5v (4) 

and are instantly laser cooled into equilibrium with the walking wave and hence 
are moving up with the velocity given in (4). When the vertical beams are 
switched off the atoms follow the trajectories of a free ballistic flight. The long 
flight time between the two interactions leads to Ramsey spectra with high 
resolution (Figs. 4 and 5). 
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Fig. 4. Measured probability P to excite 
Cs atoms in the fountain clock of PTB with 
low resolution. 




Fig. 5. Excerpt of the high resolution 
Ramsey spectrum similar to Fig.4 with 
a height of the atomic fountain of about 
0.6 m. 



Recently, a stability of 5 xl0~^‘‘(r/s)~^^^ [44] and accuracy of 1-2 xl0~^® 
[45,36] has been reported for Cs fountain clocks with smaller uncertainties an- 
ticipated. 

3.4 Clocks Based ou Ion Traps 

To keep microscopic particles at rest at a fixed position in space a strong binding 
force pointing to this point is required, l^ue to the weak interaction of neutral 
atoms or molecules with electric and magnetic fields, strong fields or field gra- 
dients are required which may seriously perturb the atomic energy levels. For 
ionized particles, however, much smaller fields can be used for confinement in 
so-called ion traps. There are several advantages of ion traps when used for fre- 
quency standards which will be briefly addressed here. First, storage times of 
days and more allow to probe ultra-narrow lines with virtually no broadening 
due to limited interaction times. Second, the concentration of the absorbers into 
a small volume allows effective application of the methods of laser cooling and 
detection of the induced signals originating from a single spot. The reduction of 
the velocity and the confinement to regions smaller than the wavelength of the 
probing radiation (Dicke regime) allows the reduction of the Doppler effect in all 
orders. According to Dicke [46] the first-order Doppler effect of a single photon 
transition of wavelength A can be suppressed if the absorber is kept in a range 
smaller than A/2. Third, the use of ultra-high vacuum reduces perturbations 
from the coupling to the environment. Fourth, the strong interaction with other 
ions can be avoided by the use of a "mono ion oscillator as potential ultimate 
laser frequency standard” as has been proposed by Dehrnelt already in 1982 [47[. 
The use of ion traps in frequency standards has been the subject of a number of 
recent reviews (see e. g. [48-51]). 

In an ion trap an electric field E{r) can be used that leads to a net force 
on an ion with charge e and mass m pointing to the center of the trap at any 
space point inside the trapping volume. The corresponding potential 4>{x, y, z) 
has a parabolic shape and may be represented hy <P cc [ax- + hy- + cz"^). For the 
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constants determining this potential the condition a + 5 + c = 0 can be derived 
from the Laplace equation. This condition is e.g. fulfilled by the two particular 
solutions a=l,5=— l,c = 0 corresponding to a linear quadrupole trap and 
a = b = l,c = —2 representing the three-dimensional Paul trap [52]. 

The two-dimensional quadrupole potential can be generated by a set of four 
hyperbolic electrodes where the potentials of adjacent electrodes change polari- 
ties. This case can be approximated by four rods (see Fig. 6 b). In the case of a 
static field in the radial direction a particle of electric charge +e will be repelled 
from positively charged electrodes experiencing a repulsive force towards the 
center at r = 0. According to the linear dependence of the field strength E on 
the coordinate similar to Hooke’s law we expect the ion to perform a harmonic 
oscillation in this direction. Along the orthogonal direction, however, the ion is 
accelerated towards the nearest negative electrode. To confine the ion for both 
directions, in the rf or Paul trap [52] the potential of both pairs of electrodes is 
alternated periodically 



= ^ {x'^ -y^)cos{u;t) . (5) 

The alternating potential leads under certain conditions [53] to a time-averaged 
net force that attracts the ions towards the nodal line. In Fig. 6 b) two additional 
ring electrodes confine the ions in the axial direction. 

In a three dimensional Paul trap the potential has the form 



^{x,y,z,t) 



Udc + Vac cos (wt) 



■ 2^0 



/ 2 I 2 2\ 

(x +y - z ). 



(6) 



A dc voltage Udc is added in general to the alternating rf voltage I4c with a 
driving frequency v = (Xi/27t to vary the ratio between the restoring forces in 
axial and radial direction. The positive sign in (6) leads to a hyperbolic surface 
with rotational symmetry around the z— axis generated by a ring electrode of 
an inner radius rg (Fig. 6a). The negative sign results in two branches of the 
hyperbola of revolution separated by the distance 2zg = \[2rg also exhibiting 
rotational symmetry with respect to the z— axis. 




a) 






Fig. 6. Radio frequency ion traps a) Paul trap with hyperbolic end caps and a ring 
electrode b) linear trap. 
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Inside the trap the trajectory of the ion is a superposition of a driven motion 
at the frequency w referred to as the micromotion and a low-frequency motion 
called the secular motion. 

Ion traps of both kinds shown in Fig. 6 have been used with many species 
for frequency standards (for an overview see e.g. [50]). Using rf transitions, most 
work has probably been done using mercury and ytterbium ions. The 12.6 GHz 
clock transition in a cloud of about 50 000 ions in a hyperbolic Paul 

trap has been utilized [18,54]. The ions were cooled with a buffer gas and the 
frequency has been determined by extrapolation to a vanishing relativistic, colli- 
sional and Stark effect shift with a relative uncertainty of 7 xl0“^^. The largest 
contribution to the systematic uncertainty resulted from the temperature of the 
ions of about 1800 K. Fisk et al [20] used two linear Paul traps and demonstrated 
a relative uncertainty of 2 xl0“^^. The Allan standard deviation as a measure 
of the frequency instability was ay{2,r) = 4.7 x 10“^‘*(r/s)“^/^. 

The 40.5 GHz clock transition in ^®®Hg has been investigated and utilized 
as a frequency standard by several groups (see e.g. [21,55-57]). Devices with 
about 2 xlO® ions cooled with He buffer gas have been designed by Gutler 
et al [56] and the frequency was measured with a relative uncertainty of 2.5 
X 10“^^. A mercury linear ion trap standard operated at the JPL gave a stability 
of (Ty(2,r) = 6.5 X 10“^^(t/s)“^/^ [57]. Slightly lower stability of (7^(2, r) = 
3.3x 10“^^(r/s)“^/^ but excellent low relative uncertainty from systematic effects 
of only 3.4 parts in 10^® was reported for a few ions in a cryogenic linear trap 
of the National Institute of Standards and Technology [21]. 

4 Optical Clocks 

The major advantage of frequency standards operating in the optical rather than 
in the rf range is based on their high frequencies. With the same interaction 
time i.e. with the same resolved line width the quality factor Q can be higher by 
several orders of magnitude. The short wavelength of the radiation in the optical 
domain, however, makes it difficult to keep the absorbers in the Dicke regime. 

4.1 Clocks Based on Ion Traps 

Ion traps allow to confine a single atom in the Dicke regime. For highest accuracy 
the number of trapped ions has to be restricted to a single ion in a hyperbolic 
Paul trap or to a few ions in a linear Paul trap since the ions have to be confined 
at the field free center or nodal line of the traps. The sensitive detection of the 
transition of an absorber consisting of a single ion is achieved by the method 
of electron shelving [47] where a strong transition shares the ground state with 
the weak clock transition. Suppose, that the fluorescence radiation from the 
strong transition is monitored. The transition of the ion into the long lived state 
immediately is identified by the disappearance of the fluorescence radiation from 
the strong transition which persists as long as the electron is shelved in the long- 
lived state. This scheme allows signal detection with unity detection efficiency. 
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Fig. 7. a) Partial energy diagram of the Yb ion b) Spectrum of the transition of a 
single Yb ion in a Panl trap as measured by quantum jumps [66] 



There is a number of suitable candidate ions for optical frequency standards 
[58] including ®®Sr+ [59,60] at 674 nm, ^^®In+ [61-63] at 236.5 nm, [64] 

at 282 nm or Ytterbium has several transitions at 411 nm [65], 435 nm 

[66], or 467 nm [67] (see Fig. 7 a). The strong transition at 369 nm is used for 
laser cooling and detection. The line at 467 nm has a natural line width of 5 
xl0“^° Hz corresponding to Q = 10^"^ [67], but due to its low transition rate it 
has been detected but not utilized yet. The 435 nm transition has been recorded 
with a line width of 80 Hz [66] by detecting the quantum jumps (see Fig. 7 b) 
where the carrier together with the sidebands due to the oscillation frequencies 
of the ion in the trap in radial (ri, T 2 ) and vertical (z) direction can be identified. 
Similar line widths have been observed with the mercury ion and it is expected 
that optical frequency standards based on single ions soon will reach relative 
uncertainties at 10“^®. 

4.2 Optical Frequency Standards Based on Neutral Atoms 

From the proposed optical frequency standards based on neutral atoms (see e.g 
[68,69]) hydrogen [70], xenon [71,72] and the alkaline earth metals have received 
the most attention. The narrow intercombination line of the latter ones was 
utilized e.g. in Mg [73], Ca [74,75], or Sr [76]. Here the recent work of an optical 
frequency standard based on ^°Ca is reported. 

4.3 Expanding Cloud of Cold Ballistic Ca Atoms 

Ca atoms from the low velocity tail of the Boltzmann distribution {vprob — 600 
m/s, Toven — 900 K) are cooled and trapped in a MOT [12] (see Fig. 8). The 
Ca intercombination line at A = 657 nm is interrogated by the radiation from 
a high-resolution dye laser spectrometer [77] or a diode laser spectrometer [78]. 
The laser pulses for the excitation are generated by acousto-optic modulators 
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Fig. 8. Experimental setup for an optical Ca standard. 



(AO Ms) (Fig. 8) used as “light switches”. The radiation at A = 423 nm used 
to cool, deflect, and trap the Ca atoms in the MOT are produced either by a 
dye laser or a frequency doubled diode laser system. Up to 10^ atoms are loaded 
during about twenty milliseconds. To avoid systematic frequency shifts due to the 
Zeeman effect of the magnetic held of the MOT and light shifts due to the 423 nm 
radiation, both the magnetic held and the light of the trapping laser have to be 
shut off before interrogating the Ca atoms. The atomic cloud expands according 
to the root -mean-square (rms) velocity of the ballistic atoms of Vrms — 1 m/s. In 
order to achieve high spectral resolution combined with a good signal-to-noise 
ratio (S/N), the method of separated held excitation (optical Ramsey excitation) 
is applied. Short pulses of 1 /iS duration are used to excite a significant part of 
the cold ensemble of atoms. The corresponding time sequence consisted of atom 
trapping (ti = 15 ms), turning off the trapping fields (t 2 = 0.5 ms), separated- 
fleld excitation, and detection (t 4 = 0.5 ms). If the laser frequency is tuned close 
to the resonance, the fluorescence intensity contains a contribution which varies 
with the cosine of the laser detuning (^ — vca) times 27rt (Fig. 9). In contrast 
to rf Ramsey excitation, in the optical regime the atoms are not in the Dicke 




Fig. 9. Optical Ramsey resonances excited in an expanding cloud of Ca atoms released 
from a MOT . 
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regime. Hence, three [79] or four [80] interaction zones have to be used to excite 
Doppler-free optical Ramsey resonances. Horde has shown that the resonance 
structures can be interpreted as atom interferences generated by the excitation 
with separated fields [81]. The narrowest fringe width obtained with the Ca 
standard was below 400 Hz. PTB operates two different systems [74,82] allowing 
one to compare the transition frequencies of the intercombination line measured 
at two independent ensembles of Ca atoms and to check the reproducibility of the 
frequency delivered by such a Ca standard. Arelative uncertainty below 10“^^ 
was derived and it was shown that for an optimized system a relative uncertainty 
of about 10“^® [82] can be expected. A stability of 1.3 x 10 “^"‘(t/s)“^/^ was 
obtained for a similar system at NIST [75] . 

5 Measurement of Optical Frequencies 

A frequency stabilized laser only represents an optical frequency standard if its 
frequency is compared directly or indirectly with the frequency of the primary 
standard of time and frequency i.e. with the Cs atomic clock. This comparison 
meets with some difficulties due to the fact that both frequencies differ by al- 
most five orders of magnitudes. Therefore, up to now not many frequencies of 
optical frequency standards were measured. Some recent measurements include 
the optical Ca frequency standard [83,74], optical transitions in atomic hydrogen 
[84,85], the Sr+ ion standard [60], or the In+ ion standard [63]. As an example 
of a so-called frequency chain, PTB’s phase-coherent frequency chain from the 
Cs atomic clock to the optical regime is shown in Fig. 10. This chain is described 
in more detail elsewhere [83] and it suffices to recall that it basically applies 
the method of harmonic mixing thereby connecting the Ca standard via various 
intermediate oscillators (diode laser, colour center laser, CO 2 lasers, methanol 
laser, backward wave oscillator, Gunn oscillator, and H-maser) to the Cs clock. 
As an example, consider the uppermost stage of Fig. 10. The frequency of a diode 
laser is tuned near half of the frequency of the Ca stabilized laser. The radiation 
of the diode laser is frequency doubled in a non-linear crystal and the beat note 
between this radiation and the radiation of the Ca stabilized laser is monitored 
with a photo diode. If the signal of the photo diode is phase locked using a 
Phase Locked Loop (PLL) the frequency ratio of both lasers is kept constant 
and can be used to determine the frequency of the Ca stabilized laser, provided 
that the frequency of the diode laser and the frequency of the PLL are known. 
In the same way, the frequency ratios of all other stages can be related. Such a 
frequency chain allows a phase-coherent optical frequency measurement which, 
in principle, does not contribute to the uncertainty of the measured frequency 
value provided, cycle slips can be excluded in each multiplication step. 

Several frequency measurements during more than two years have been per- 
formed at PTB using two different MOTs, different resolutions, and different 
stabilization schemes. The weighted mean of all frequency measurements up to 
now is i^ca = 455 986 240 494.13 (12) kHz. This uncertainty makes the Ca sta- 
bilized laser one of the most accurate optical frequency standards in the visible. 
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Fig. 10. Frequency measurement chain of PTB. 



362 



F. Riehle 



Table 2. Radiations recommended by the CIPM for the realization of the meter [86] 



Atom or 
Molecule 


Transition 


Wavelength 

(pm) 


Rel. Standard 
Uncertainty 


iH 


15 - 25 


0.243 134 624 6260 


8.5 xl0~^® 


12712 


32-0 R(56), aio 


0.532 245 03614 


7.0 xl0”“ 


127l2 


11 - 5, R(127), ai 3 or (i) 


0.632 991 398 22 


2.5 xl0~“ 


«Ca 


^P\ — ^TYij 0 


0.657 459 439 291 7 


6.0 xlO”^® 


88gr+ 


5^5i/2 - 4^ 7 ) 5/2 


0.674 025 590 95 


1.3 xl0”^° 


CH 4 


7 - 6, 1 / 3 , P(7), 


3.392 231 397 327 


3.0 xl0“^^ 




year ► 

Fig. 11. Progress in the accuracy of the re- 
alization of the length unit during the past 
in the International System of Units (SI) 
according to the definitions of the CGPM 
[89,90] and the recommendations of the 
CIPM [91,92,86]. 



today. Consequently, this standard has 
been recommended recently by the 
CIPM [86] for the realization of the 
meter with the lowest uncertainty. The 
instability of a frequency measurement 
including the stabilities of the opti- 
cal Ca standard and of the hydro- 
gen maser was shown to decrease with 
\/t for integration times between r 
= 1 s and r = 1000 s. It reaches its 
flicker floor at approximately 10“^^. 
Frequency measurement chains used 
so far like the one of Fig. 10 are ex- 
pected to be replaced by chains that 
use less oscillators. The progress of 
new chains based on optical frequency 
interval divider chains [84] , optical fre- 
quency comb generators [87] or mode- 
locked lasers [88] is expected to allow 
the measurement of optical frequen- 
cies with less effort and may eventu- 
ally lead to optical clocks competitive 
or superior to the best rf clocks. 



6 Optical Frequency Standards for the Realization 
of the Meter 

Optical frequency standards generate optical reference frequencies which are 
required in precision laser spectroscopy, optical communication and in length 
and frequency metrology. From the optical frequency standards developed at 
various frequencies in many laboratories the International Committee of Weights 
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and Measures (CIPM) has evaluated and selected several radiations of such 
stabilized lasers and recommends them as references for the realization of the 
meter and for precision spectroscopy (see Table 2, [91,92,86]). Their wavelengths 
presently range from the near Ultra-Violet (UV) (243 nm) to the Infra-Red (IR) 
(10. 3 pm). The advantage of frequency standards for the realization of the meter 
is visualized from Fig.ll. The pace of the development of optical frequency 
standards and clocks with reduced uncertainties does not seem to slow down 
as can be concluded from the fact that the uncertainties of the optical Sr [60] 
and the hydrogen 1S-2S frequency or of the Ca stabilized laser [74] already are 
considerably lower than given in Table 2. 
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Abstract. SpaceTime is a mission concept developed to test the Equivalence Prin- 
ciple. The mission is based on a clock experiment that will search for a violation of 
the Equivalence Principle through the observation of a variation of the fine struc- 
ture constant, a. A spatio-temporal variation of a is expected in some string theories 
aimed at unifying gravity with other forces in nature. SpaceTime uses a special tri- 
clock instrument on a spacecraft which approaches the snn to within four solar radii. 
The instrument consists of three trapped ion clocks based on mercury, cadmium, and 
ytterbium ions, in the same environment. This configuration allows for a differential 
measurement of the frequency of the clocks and the cancellation of perturbations com- 
mon to the three. The observation of any frequency drift between each of the clocks, as 
the tri-clock instrument approaches the sun, signals the existence of a scalar partner 
to the tensor gravity. Some relevant details of the mission design are discussed in the 
paper. 

1 Introduction 

The unification of gravity with other forces of nature is arguably the most ur- 
gent problem in theoretical physics. Yet the unification program, initiated by 
Albert Einstein shortly after his introduction of general relativity, has proved to 
be a difficult challenge and remains an open problem today. The lack of a clear 
path to unification is even more frustrating since the two underlying theories, 
the quantum field theory and general relativity, have been separately immensely 
successful. Quantum field theory is widely regarded as the most successful the- 
ory in physics, capable of reproducing details of interaction for all matter. The 
predictions of this theory have been upheld by the experimental scrutiny in nu- 
merous tests. General relativity, on the other hand, has widened our picture of 
reality, and helped us consider the birth of the Universe, the cosmological evo- 
lution, and exotic objects such as the black hole. In the past eighty years since 
its introduction, this theory has also withstood the most exacting scrutiny that 
experimental physics has been able to devise [1]. Experimental tests and obser- 
vations performed in vastly different conditions, ranging from the weak gravity 
of solar system, to the strong fields of binary pulsars, have all failed to reveal 
any violation of general relativity. In fact the failure of these very elaborate ex- 
periments with their impressive sensitivity in finding any violation has branded 
them with the label “Null Experiments” , prompting the cynics to declare these 
tests as measuring zero with ever higher accuracy! 
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Yet the impetus behind devising other, more sensitive, tests of general rela- 
tivity is the unification, itself. Amongst promising candidates for the unification 
program, string theories stand out since they evidently have the ingredients nec- 
essary to accomplish the task. All viable string theories, nevertheless, require 
a massless scalar field partner to the tensor gravity, in contradiction to Ein- 
stein’s Equivalence Principle (EEP), the pillar on which general relativity and 
other metric theories of gravity stand. This reality has led to a widespread belief 
among theorists that EEP must be violated, and experiments with requisite sen- 
sitivity can detect such violation. Thus the experimental tests of EEP now hold 
the promise of uncovering the range of validity of general relativity. And since 
some string theories make specific predictions regarding the nature and strength 
of the EEP violating scalar fields [2], sensitive tests of the EEP also stand to 
identify viable string theories. 

That a violation of EEP will signal evidence for new physics extends beyond 
the promise of the unified fields; it directly confronts some of the most pressing 
questions in cosmology. There have been a number of recent suggestions that 
the so called “cosmological constant problem” may be linked to the presence 
of a scalar (quintessence) field which could be manifested by a violation of the 
Equivalence Principle [3]. 

Among various approaches to test general relativity, atomic clock tests are 
particularly significant since they represent a direct test of the coupling of gravity 
to the electromagnetic field. In a recent paper [4] we have extended the ability 
of atomic clocks in testing general relativity to a local test of the variation of the 
fine structure constant, a. A variation of the fine structure constant will signal a 
violation of the EEP [1]. On the other hand, a spatiotemporal variation of a, the 
coupling constant of the electromagnetic field, on the Hubble time scale is also 
implied by the dilaton and other scalar fields (moduli), which are the necessary 
partners of the tensor gravity field in string theories [5] . So a clock comparison 
test can fulfill the promise of aiding the unification program by pointing to the 
viability, or not, of string theories predicting a variation of fundamental coupling 
constants. 

A clock test near the sun is particularly suitable to test the variation of a, 
since it can do so with a sensitivity far beyond that achievable on or near Earth. 
Finally, clock tests are also useful as a needed complement to “free fall” tests 
of general relativity. In the latter type of tests, the specifics of the coupling of 
gravity to any, or various, matter fields must be inferred from apportioning any 
observed difference in free fall of two, or more, test masses to various types of 
(mass) energy. Since the coupling of the dilaton field to the Coulomb energy 
dominates, when combined with clock tests (which are strictly based on atomic 
transitions driven only by the electromagnetic field) free fall tests will offer a 
vastly clearer picture of the details of the coupling of gravity to matter fields. 

In the following section we will motivate a mission concept to fly a special 
tri-clock to within four solar radii of the sun in search for the observation of a 
varying a. This mission is referred to as SpaceTime. We will describe the tri- 
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clock instrument in section 3, and the mission design in section 4. The conclusions 
will be given in section 5. 

2 Motivation 

Atomic clocks are based on electronic transitions in neutral or ionized atoms 
which result in the emission or absorption of a photon. The frequency of the 
photon is fixed by the difference in the energy levels that the photon connects, 
and the inverse of the frequency determines a time interval. This simple relation 
provides the means for a sensitive test of any variability in the fine structure 
constant, owing to the electromagnetic nature of the energy in the electronic 
structure of the atom. 

A change in a with time can be detected by comparing the frequency of 
two clocks based on dissimilar atoms, and through measuring the frequency 
drift between them. In the case of clocks based on hyperfine transitions, the 
a dependence of the hyperfine energy in each atom differs, depending on the 
atomic number, Z . This arises from the presence of a relativistic term in the 
hyperfine energy splitting, the so called Casimir correction factor F{Za), which 
accounts for the interaction between the magnetic field of the nucleus and that 
of the valence electron in the finite nuclear volume. It has been shown [4] that a 
time variation of a may be related to the variation in the transition frequency 
of atoms 1 and 2, on which two clocks are based, via, 

I In ^ = [LaF{aZ,) - L,F{aZ,)]~ . (1) 

at A 2 a at 

Here Ai {i = 1,2) is related to the hyperfine transition frequency, and LdF{Zi) 
represents the sensitivity of the atom with a nuclear charge Zi to the time 
variation of a. 

The difference in the sensitivity of various hyperfine clocks to the time varia- 
tion of the fine structure constant naturally points to experiments where clocks 
based on different atoms are compared for a time, and their relative frequency 
drift is measured to obtain a. Such a test was previously performed by compar- 
ing a mercury ion clock and a hydrogen maser for a period of about six months; 
it provided an upper limit for a/a to be 3.7 x 10“^'^/yr. 

There are two ways to improve the current limit on a variation set with clocks: 
Improved clock stability or longer measurement intervals for clock comparison. 
The first approach is possible, but based on the best available clock technologies, 
is limited to about one order of magnitude improvement. The second approach is 
difficult to implement, since the results improve with the observation time only 
linearly. Furthermore, the longer the observation time with separate clocks, the 
harder it is to exclude any environmentally caused frequency drift from spoiling 
the data. 

An equivalent approach to testing for a time varying fine structure ’’constant” 
is a search for a spatial variation of a in a strong gravitational potential. The 
presence of any new long range force based on a scalar field will result in an 
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additional frequency shift in the clocks, the value of which will depend on the 
nuclear charge of the atoms of each clock. Since 98 % of the mass of the solar 
system resides in the sun, comparison of the frequency of clocks near the sun 
provides for the highest achievable sensitivity for such a test. At four solar radii, 
the sun’s gravitational potential (divided by c^, where c is the speed of light) is 
5x 10“^, so clocks accurate to about 10“^® will enable measurement of differential 
frequency shifts at the level of of general relativity’s universal prediction. 

This notion is the basis for the proposed SpaceTime mission, which attempts 
to combine the opportunity of the increased sensitivity at the proximity of Sun 
with a unique “tri-clock”, to provide for an unambiguous test of a variation. 
The sensitivity of such an experiment may be compared with any test of time 
variation in a in the following manner. The variation of a with the variation of 
the gravitational potential may be written as: 



Sa 6U 

^ ^ 5 

a 



( 2 ) 



where S is the sensitivity of the experiment in the scenario suggested 

above) and U is the Newtonian potential. Taking the time derivative of the 
above equation leads to: 



a 



(3) 



The time variation of U, for a closed universe, is related to the Hubble constant 
H hy if /c^ ~ H. Thus the equivalent sensitivity of SpaceTime to the detection 
of any time variation in a is: 

- = SxH~ 10"2°/year (4) 

a 



for H less than 10“^°. 

This sensitivity should be contrasted to the limit of a/ a < 10“^® placed by 
comparing the ratio of isotopes 147 and 149 of Sumarium in the natural uranium 
fission reactor in the Oklo mine in Gabon [6] . It also exceeds the recent results of 
the observational astronomy with the tantalizing hint of a possible a variation, 
[7] by four orders of magnitude. 



3 The Tri— clock Instrument 

The tri-clock instrument design is key to the sensitivity of SpaceTime to an 
a variation, and is based on the trapped ion frequency standards of the JPL. 
The interest in ion traps as the basis for atomic clocks stems from the general 
requirements for achieving high performance. The achievable stability of atomic 
clocks depends on the energy level structure of the active atom, as well as the 
environment in which the atom undergoes the clock transition. The first condi- 
tion determines the achievable stability inherent to the atom via the sensitivity 
of the atomic transition to the environmental perturbations. The second deter- 
mines the degree to which the environmental perturbations can be reduced with 
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respect to the atomic energy level sensitivity. The trapped ion frequency stan- 
dard fulfills both these conditions, since ions with inherently high insensitivity to 
environmental changes are readily confined in traps which provide an essentially 
benign environment. The relatively large hyperfine level separation of ions, com- 
pared to neutral atoms, make them inherently less sensitive to environmental 
perturbations, such as the ambient magnetic fields 

Research in the past two decades in the field of trapped ion clocks has already 
produced impressive results with the development of a standard with stability 
corresponding to 7 x [8], and has demonstrated the highest measured 

stability for averaging intervals longer than 10® s. A brief description of the 
operation of the trapped ion frequency standard is given below, for completeness. 
The interested reader is referred to the literature [9-11] for more information. 

In trapped ion clocks, ions of the atomic species of interest are confined in 
an electrodynamic trap. The trap may be constructed of electrodes of various 
geometry. In the most widely used trap type, four rods arranged in a cylindrical 
geometry form the electrodes [9]. A radio frequency (rf) voltage applied to the 
trap keeps the neighboring electrodes at opposite potentials at any instant of 
time. Ions are created in the trap by the collision of electrons generated from 
a hot filament on a background vapor of the atom of interest. Once generated 
within the trap, ions experience the quadrupolar rf potential created by the 
electrodes. This quadrupolar potential is zero along the axis of the trap, and 
increases as ions get closer to the electrodes. Ions experience a force resulting 
from the interaction with the field which varies in direction as the field oscillates. 
If the trap rf frequency is high enough, it results in a net force directed towards 
the trap axis (the zero field position) at all times, and ions are thus trapped. 

Once trapped, the population of the two hyperfine levels of the ion are nearly 
the same at room temperature, as required by the Boltzman distribution. This 
implies that ions must be prepared in the desired state, from which the clock 
transition takes place. The state preparation is accomplished by optical pump- 
ing, whereby ground state ions are first pumped from one of the ground state 
hyperfine sublevels (typically the highest lying in energy) to the first electronic 
excited state, with light from a resonance lamp or a laser. Upon their decay, 
excited atoms make a transition to all hyperfine sublevels of the ground state. 
Since the lifetime of the excited state is short (only a few nanoseconds), the 
applied light depletes the upper hyperfine sublevel, while increasing the popu- 
lation of the lower sublevels. When the optical pumping is complete, and the 
upper hyperfine sublevel is completely empty, the atom is prepared for the clock 
transition which can be induced from one of the lower sublevels with population, 
to an empty sublevel. 

The next step in the operation of the clock entails inducing an interaction be- 
tween the atoms, and the microwave radiation from a local oscillator (LO) source 
at the clock frequency. It is this cycle in the clock operation which requires that 
the ions be shielded from any interaction with the environment. Since hyperfine 
transitions are due to magnetic dipole interactions between the electron and the 
applied field, shielding from external magnetic fields is of paramount importance. 
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For the same reason, it is practically more advantageous to apply a small, yet 
fixed, magnetic field to the trap, while shielding it from the outside magnetic 
field variations. 

If the frequency of the LO exactly matches the clock transition, the depleted 
upper hyperfine sublevel becomes populated again, and can be used to scatter 
photons from the resonance lamp again. Otherwise, the clock transition does not 
populate the upper sublevel, and the atoms remain transparent to the light from 
the resonance lamp. This information (scattered photons from the ion) is used in 
a feedback loop that controls the frequency of the LO to ensure its coincidence 
with the clock transition in the ion. The output of the LO, disciplined to the 
atomic transition, constitutes the clock output. 

There is one more important point about the LO which should be considered 
here. While the output of the LO is disciplined to the atomic transition, the LO is 
free running (i.e. without feedback control) for the time intervals corresponding 
to ion generation, optical pumping, clock transition, and clock interrogation 
cycles. Most of these cycles are short, on the order of a second long, but the 
microwave transition must be long, since the inverse of its length determines the 
width of the atomic line. In high performance clocks, the atomic line is as narrow 
as practical since it directly determines the achievable stability. This length of 
the microwave transition cycle typically ranges from about ten seconds to as 
long as a minute. A major obstacle in the realization of ultra-high performance 
atomic clocks, as required in precision measurements, is the availability of a LO 
whose noise during the free running interval is low. It has been shown [12] that 
the noise of the LO during the free running cycle can limit the long term stability 
of the atomic clocks based on cyclic feedback. 

For the SpaceTime mission we have modified the scheme outlined above to 
achieve the best performance. Here by “best performance” we mean one that can 
satisfy the following requirements: 1) a reliable, power efficient, compact, and 
low cost instrument, compatible with the space environment; 2) the requisite 
stability of a part in 10“^® in about 70000 s; and 3) the capability to produce 
unambiguous results. 

At the core of our design is a modified architecture of the ion trap, to include 
two segments (Fig.l) [13]. 

In this configuration, the generation of ions, their optical pumping, and the 
final interrogation for the clock transition takes place in the first trap segment. 
The interaction of ions with the applied microwave signal from the LO occurs 
in the second segment. Ions, being charged particles, can be gently “shuttled” 
back an forth between the two segments by applying a small dc potential for to 
the “end cap” separating the two segments. Thus only the second trap segment, 
where ions interact with the microwave signal, needs to be isolated from the 
environmental perturbations. The major advantage of this architecture is that 
the requirement of the environmental shielding is significantly reduced, leading to 
considerable reduction in size and mass. This ability is crucial to the realization 
of any spaceborne clock. 
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Fig. 1. Line drawing of the Shuttle Trap. 



The second important modification is a geometry whereby three of these 
traps share the same vacuum chamber, applied potentials, and magnetic and 
thermal environments, as shown in Fig. 2. 

Ions of three different atomic species are confined in different traps, forming 
the basis for three separate clocks. The cycles of ion generation, optical pumping, 
clock transition, and interrogation are common to all ions in the three traps. Each 
trap confining potential, though, is set to trap only one of the three species. The 
three potentials are nevertheless derived from the same source, and reduced 
to the required value. The hyperfine transition in each of the ion species is 
driven at the same cycle. While the transition frequency for each of the three 
species is different, they are derived from a single local oscillator. Three different 
resonant lamps optically pump, and later interrogate each of the ion species. This 
arrangement implies that most environmental perturbations will be common to 
ions confined in any of the three traps. Thus environmentally induced drifts can 
be caused to be cancelled. This is another crucial aspect of the SpaceTime clock 
experiment. Comparison of three independent clocks would leave the task of 
unraveling the influence of the environment’s perturbations on each clock, from 
a “true” signal to modeling with associated uncertainty. 



376 L. Maleki and J. Prestage 



WligMIrandadoigaLamfaviiligMgui^n 




Fig. 2. Layout of the Tri-clock instrument. 



For the SpaceTime mission, the specific design of the tri-clock calls for us- 
ing mercury, cadmium, and ytterbium ions in the three traps. The hyperfine 
energy levels of each ions, at 40.5, 14, and 12.5 GHz are shown in Fig. 2. Each of 
these ions has been previously used in the realization of an atomic clock [11]. In 
the tri-clock instrument the optical pumping and the clock interrogation cycles 
use resonant lamps. By manufacturing lamps which include all three species of 
appropriate isotope, we plan to realize redundancy, another important consider- 
ation in designing space instruments. 

We plan to use the redundancy in the three clocks to measure, and further 
eliminate, any environmental perturbations with different signature on different 
ion species. In particular, the value of the any ambient magnetic field may be 
obtained by using one of the three ions as the “probe” to measure the field, 
which can then be used to cancel out its effect on the remaining two clocks. 

A major advantage realized with the tri-clock architecture is the use of a 
single local oscillator, from which all three hyperfine signals can be derived. This 
approach will ensure that the noise of the local oscillator during its free running 
interval is a common noise to all three clocks, and thus can be cancelled out to 
a great extent. The extent of the cancellation is in the ratio of the frequencies, 
with the largest noise on the 40.5 GHz (mercury transition), and the smallest 
for the 12.5 GHz (ytterbium transition.) This cancellation is large enough that a 
high stability quartz oscillator would suffice as an LO to achieve 10“^® stability 
at 7000 s, as required for SpaceTime. 

The tri-clock architecture will thus allow the major sources of noise and 
perturbation to become common to all three clocks. This enables the cancellation 
of all the common noise when the three clocks are inter-compared, to a level that 
will allow the requisite sensitivity for finding a varying a. This feature, together 
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with the specific dependence of a varying a on the nuclear charge (Z) of each 
ion species, will produce an unambiguous signal as the three clocks are inter- 
compared. Finally, the SpaceTime experiment based on a differential clock shift 
measurement with the tri-clock instrument eliminates the need for comparing a 
spaceborne clock with a ground based clock. This latter approach would greatly 
complicate the mission by need for a stable, high data rate communication link, 
and associated need for the modeling of the media through which it propagates. 
The task of modeling alone, owing to the proximity to Sun, is difficult and subject 
to uncertainties. In short, the design of the tri-clock instrument provides for 
simplification of the mission, and reduction of weight, all of which significantly 
help the cost of the experiment. These benefits of the tri-clock complement 
the expected high performance and ensure that its signal are unambiguously 
interpreted. 



4 Mission Design 

The SpaceTime is designed to fit the NASA Mid-sized Explorer mission type 
(MIDEX). This mission type has specific limits set for the total cost, and the 
schedule. The goal of the SpaceTime mission design is to deliver the clock payload 
to the close vicinity of the Sun while minimizing complexity and risk. SpaceTime 
launches from Cape Canaveral Air Station (CCAS) on a Delta 7925 launch 
vehicle. This proven vehicle simplifies all aspects of the mission and allows us 
to reduce risk and cost, and return breakthrough science in the shortest time 
possible. After the first two stages burn, the third stage spins up to 60 rpm and 
injects the spacecraft towards Jupiter. The target at Jupiter is to within 8.68 
Jupiter radii, which will lead to a gravity assist to reduce the spacecraft energy, 
allowing it to fall toward the Sun with a 4 solar radii pass. The SpaceTime 
mission plans for two trajectory correction maneuvers (TCMs) to achieve its 
trajectory. The first maneuver will occur approximately 10 days after launch, 
and the second about one year after launch. Since SpaceTime differential redshift 
measurements allow a 10% error in the solar flyby radius, we target to 4.2 Rs ± 
0.2 Rg. This allows tolerable large navigation uncertainties at the Jupiter flyby. 

After the Jupiter encounter, the spacecraft falls rapidly toward the Sun, and 
will take only about 9 hours to travel from 10 Rg to A Rg. This trajectory greatly 
simplified navigation, and the tracking scheme allows for a Doppler track once 
every 1-3 months. This in turn simplifies mission operations and reduces cost at 
no increase to mission risk. 



4.1 Spacecraft 

The SpaceTime spacecraft design is simple and reliable. This design leads to 
reduced cost and risk in mission operations and allows the science data return 
at its critical point near the Sun. The SpaceTime spacecraft utilizes extensive 
heritage from past spacecraft, including Mars Pathfinder, Cassini, DS-1, Lewis, 
MSP ’98, and Defense Systems programs. Comprehensive analysis has produced 
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the best small spacecraft design for the SpaceTime mission within the budget 
constraints. The primary purpose of the spacecraft is to take the SpaceTime 
clocks as close as possible to the Sun and to allow the acquisition and trans- 
mission of data from the clocks as the spacecraft nears the Sun. Figs. 3 and 4 
show a spacecraft close-up and a near-solar pass configuration. The spacecraft 
integrates a blowdown mono-propellant propulsion system and the necessary 
spacecraft electronics to support the mission. The spacecraft electronics and the 
clock are all within a single thermal enclosure. 

The large, conical solar shield protects the 
entire spacecraft during the 4 solar radii en- 
counter. The thermal control system of the 
SpaceTime spacecraft will utilize this ther- 
mal shield system that will isolate its flight 
elements from the solar flux of a 4 (from 
the solar center) close passage. This shield 
system is based on a new technology and 
consists of a solar blocking element (primary 
shield), IR shields. High Temperature MLI, 
and support structure. The system is a sim- 
pler version of the Solar Probe shield system. 
The first element of this shield will operate 
at about 2000 K during close solar approach, 
but the shield system will provide thermal 
isolation that will allow the spacecraft ele- 
ments behind it to operate at nominal tem- 
peratures (0° to -1-40° C). 

The spacecraft thermal control system components behind the shield system 
utilizes flight-proven elements such as thermal conduction isolation, thermal 
surfaces (paints, films), MLI, louvers, and electric heaters/thermostats. This 
design will control spacecraft element temperatures for all flight regimes from 
launch to Jupiter flyby, and close solar approach. 

The SpaceTime spacecraft current best estimate dry mass is 114 kg, including 
the 20-kg science payload, with contingencies of 30% in both mass and power. 
The fuel load is about 30 kg for a launch mass of around 185 kg, the maximum 
allowable mass that can be launched on a Delta 7925 with Star 30c kick stage to 
a C3 of 120km^/sec^. The attitude control subsystem (ACS) subsystem provides 
attitude knowledge and control, and solar array articulation and control. The 
Medium Gain Antenna (MCA) is body-mounted, perpendicular to the space- 
craft centerline. Twin star trackers and an Inertial Measurement Unit (IMU) 
provide primary attitude determination, with four analog Sun sensors serving 
in a functional redundancy role as backup pointing devices. During the Jovian 
gravity assist, the ACS system will operate in an all-stellar mode so that power 
can be conserved by turning off the IMU. 





SpaceTime Mission 379 



^ , r-»c-vr /EARTH «t PERIHCUON 

..X nancl to L)oIji /(aiouadfatute) 

^ * JUPITER GRAVITY 

ASSIST FLYBY 




Fig. 4. picture trajectory 



The Command and Data Handling (C&DH) subsystem provides instrument 
science data storage, spacecraft commanding, interfaces to the X~Band transpon- 
der, ACS sensors and actuators, and spacecraft timing. 

The communications subsystem supports X-band command uplink and data 
downlink using the Small Deep Space Transponder, a 3-watt Solid State Power 
Amplifier, one 0.6-meter diameter MCA, and dual low-gain patch antenna (LGA) 
The MCA provides 100 bps data transmission near Jupiter and 100 bps during 
the solar flyby. The LGAs provide emergency command capability. 

The power subsystem provides power collection, storage, control, and distri- 
bution. The Electrical Power Subsystem (EPS) is comprised of the solar array, 
battery, charge control unit (CCU), power distribution and drive unit (PDDU), 
and pyro initiator unit (PIU). Because of the profile of this mission, the Space- 
Time spacecraft carries two sets of featherable solar arrays, a small secondary 
battery and a high-energy primary battery for the critical final phase of the 
mission when the solar arrays can no longer operate. The low temperature solar 
panels are populated with 6.7 m^ of silicon solar cells, providing 1500 W aver- 
age power near Earth. The low temperature solar arrays are sized for stay-alive 
power near Jupiter. The low temperature array is jettisoned after crossing Mer- 
cury’s orbit at 0.3 AU. The high temperature portion of the solar array consists 
of 0.15 m2 of Gallium Indium Phosphide cells at 6% efficiency and provides 
power after crossing 0.3 AU. These arrays will provide up to 200W of power 
as close as 0.1 to 0.15 AU when feathered to a 70-degree angle of incidence. 
This will allow the SpaceTime clocks to be calibrated prior to the final phase of 
the experiment. The high temperature arrays will also be jettisoned to prevent 
an imbalance in control forces near the Sun. The High-Energy Density (HED) 
primary battery, a 300 A-hour Lithium Garbon Fluoride (LiGF) package with a 
long shelf life, provides the 3730 W-hr necessary during the 33-hour solar flyby 
after the solar arrays are jettisoned at about 0.13 AU. 

5 Conclusions 

In this article we have described a new mission design for a sensitive clock test 
of Einstein’s Equivalence Principle. The mission includes a payload consisting of 
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a unique tri-clock instrument, based on cadmium, ytterbium, and mercury ions. 
The instrument will fly to within four solar radii of the Sun, where a differential 
drift between the three clocks will signal a violation of the Equivalence Principle. 
The instrument is designed to eliminate or minimize all ambient noise sources, 
and with the designed stability of 1 x 10^®, enables an unambiguous test at a part 
in 10^® level. This same measurement, interpreted as a search for a variation of 
the fine structure constant alpha, will test the validity of string based theories 
that specify a scalar held partner, such as the dilaton, to the tensor gravity. 
An important feature of this mission is that it is based on proven, existing 
technology, and can be flown within the cost and schedule constraints of NASA’s 
MID EX (Medium-sized Explorer) program. 
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Abstract. 25 years ago, in summer 1974, Joseph Taylor and Russell Hulse discovered 
the first binary pulsar, a pulsar in orbit with a compact companion which itself is 
most likely a neutron star. This pulsar, denoted PSR B1913+16, turned out to be the 
most exciting laboratory for testing relativistic gravity theories. Before the discovery of 
PSR B1913+16 all gravity experiments were conhned to our solar system with its very 
weak gravitational fields. Hence, it has been possible to test gravity theories only in 
the hrst post-Newtonian approximation. Binary pulsars take us beyond the weak-field 
context because of their high orbital velocity and/or the strong self-gravitational fields 
of neutron stars. 

To date, more than 70 binary pulsars have been discovered, most of them in orbit 
with a neutron star or a white dwarf. Many binary pulsars belong to a group of so-called 
millisecond pulsars which have very short rotational periods (< 20 ms) and slowdown 
rates of typically 10~^°, proving to be extremely accurate clocks. This gave rise to a 
variety of new gravity experiments, like tests for the strong equivalence principle. 

After a brief introduction to pulsars, the technical and theoretical aspects of binary- 
pulsar gravity experiments are reviewed. The latest results are presented and an outlook 
is given to future improvements of these experiments. 



1 The Pulsar Population 

Pulsars were discovered in 1967 by Antony Hewish and Jocelyn Bell at Cam- 
bridge University. During a study of rapid fluctuations of extragalactic radio 
sources caused by the interstellar medium they came across point like sources 
which were emitting radio signals with extraordinary regularity [18]. Very soon 
after their discovery, pulsars were understood to be highly magnetized rapidly 
rotating neutron stars in our Galaxy, becoming apparent as ‘cosmic lighthouses’ 
with excellent rotational stability. Over the past 30 years, systematic surveys 
with the world’s largest radio telescopes have revealed more than 1200 pulsars. 
About 500 of them were found only recently in the on-going Parkes-multibeam 
Galactic-plane survey [9] which proved to be extraordinarily successful and lead 
to the discovery of a number of exciting single and binary pulsars. To date, ro- 
tational periods of pulsars, P, span nearly 4 orders of magnitude ranging from 
1.56 ms to 8.51 s. The observed period changes, P, explained by the loss of rota- 
tional energy due to magnetic dipole radiation and/or relativistic particle winds, 
range from 10“^^ to a few times 10“^^, corresponding to magnetic surface fields 
of about 10® to 10^"^ Gauss, respectively. The majority of pulsars are clustered 
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around a P of one second and a P of 10“^^ (see Fig. 1). It is believed that these 
pulsars were born in supernova explosions with rotational periods of a few tens 
of milliseconds and then spun down into the region of ‘normal’ pulsars within 
10® to 10^ years. 

There is a group of pulsars whose characteristic age (= P/2P) typically 
exceeds 10® years and whose magnetic surface fields lie below 10^® Gauss. Most 
of them are found to be members of binary systems and are located to the 
bottom left of the large group of ‘normal’ pulsars in Fig. 1. It is understood that 
these pulsars were born as ‘normal’ pulsars in binary systems. Then, after 10® 
to 10® years the rotation became too slow and the pulsar-emission mechanism 
ceased to work. In the meantime the companion star evolved, started to fill 
its Roche lobe and transfer matter onto the neutron-star. The infall of matter 
spun up the neutron star and suppressed the magnetic field by several orders 
of magnitude, leaving a fast rotating ‘recycled’ pulsar with a weak magnetic 
field behind. Depending on the initial mass, the companion star will either form 
a second neutron star in a supernova explosion and create, in cases where the 
supernova explosion does not unbind the binary, an eccentric double-neutron- 
star system (high mass progenitor), or end up as a white dwarf in a very circular 
orbit about a pulsar with rotational periods of, typically, a few milliseconds (low 
mass progenitor). 

About 5% of all pulsars known to date have been identified as binary pulsars, 
i.e. pulsars which are members of binary systems. There are two binary pulsars 
which are in orbit with a main-sequence B-star [21,24] and there is a third pul- 
sar suspected to orbit a K-supergiant star [32]. The rest of the binary-pulsar 
population is believed to have a compact star as a companion (white dwarf or 
neutron star) . Since in most cases the orbital separation is much larger than the 
size of the compact companion, neither mass-transfer between the two compo- 
nents of the binary system is present, nor tidal effects are having any significant 
influence on their orbital dynamics (‘clean’ binary system). Therefore, the orbital 
dynamics of most binary-pulsar systems should follow a simple ‘point-particle’ 
model. Presently six binary pulsars are thought to have a neutron-star as their 
companion. The range of their orbital periods is from four hours to more than 
two weeks and in all six cases the orbital eccentricity is high (see Table 1 for 
details) . 



Table 1. Binary pulsars with neutron-star companions. 



PSR 


P (ms) 


Pb (days) 


e 


discovery 


comments 


Ref. 


J1141-6545 


394 


0.20 


0.17 


1999 


maybe white-dwarf companion 


[32] 


J1518-t4904 


40.9 


8.63 


0.25 


1994 




[36] 


B1534-tl2 


37.9 


0.42 


0.27 


1990 




[59] 


J1811-1736 


104 


18.8 


0.83 


1997 




[30] 


B1913-tl6 


59.0 


0.32 


0.62 


1974 


Hulse-Taylor pulsar 
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Fig. 1. Plot of period derivative, P, versus pulse period, P, for pulsars taken from 
[47,48]. Pulsars that are members of a binary systems are marked with a circle and 
pulsars associated with a supernova remnant are marked with a star. For an updated 
version of this P-P-digram see [32]. 



While the binary pulsar PSR B2303+49 was suspected to be a member of 
a double-neutron-star system, more recent optical observations using the 10- 
m Keck telescope indicate a massive white dwarf as the companion to PSR 
B2303+49 [52]. Also PSR J1 141— 6545 could, instead of a neutron star, have a 
massive white dwarf as a companion as pointed out by Tauris and Sennels [44] . 
In fact, most binary pulsars are in orbit with a white dwarf companion. While 
the orbital periods range from 1.6 hours to 3.3 years the orbital eccentricities for 
these systems are generally very small, in some cases even less than 10“®. All 
of these systems suffered a phase of mass transfer in the past which was very 
effective in circularizing the binary orbit. 

For a general review on pulsars see [31,29]. 

2 Pulsar Timing 

Pulsar timing has become an important subfield of observational astronomy and 
proved to provide a powerful tool for many areas of physical and astrophysical 
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research. In particular, the high-precision timing of millisecond pulsar (P < 20 
ms) has a wide range of applications: 

• Time-keeping metrology 

• Planetary ephemerides 

• High-precision astrometry (position and proper motion of pulsars) 

• Probing the cosmological gravitational-wave background 

• Tests of relativistic theories of gravity 

• Properties of the interstellar medium 

• Evolution of binary-star systems 

See [23,5] for a review on pulsar timing applications. 

A complete discussion of the observing procedures and instrumentation used 
in pulsar-timing experiments is beyond the scope of this article. We therefore 
only summarize the basic ideas. For a more detailed description the reader is 
referred to review articles, like [3]. Due to the distribution of free electrons in 
the interstellar medium, pulses at higher radio frequencies propagate at a higher 
group velocity and, therefore, arrive earlier at the telescope than pulses at lower 
frequencies. To correct for this propagation effect across the bandwidth of the 
receiver one uses either filterbanks to divide the observing bandpass into small 
channels or a ‘coherent dedispersion’ system, which samples the raw telescope 
voltages and then removes the effects of dispersion using a software filter. In 
practice the second method yields a clearly better timing precision. 

Most of the pulsars are weak radio sources and therefore an addition of many 
thousands of pulses is required in order to produce a pulse profile with a good 
signal-to-noise ratio. Apart from this, a high number of individual pulses has to 
be added, even for strong pulsars, to produce a stable integrated profile, since 
individual pulses vary quite dramatically (see Fig. 2). Therefore the pulse signal 
is folded over a time span of typically several minutes at the predicted topocen- 
tric pulse period in order to produce a total-intensity profile. Cross-correlation 
with a standard profile with high signal-to-noise ratio, obtained during previ- 
ous observations, determines the observed rotational phase of the pulsar (f> as a 
function of the telescope time r or, equivalently, the time of arrival (TOA) of 
a chosen point of reference in the pulse profile. For a pulse profile with good 
signal-to-noise ratio the location of the point of reference with respect to the 
time stamp can be determined with high precision, typically 10“^ x P. 

The TOA, which is measured using the time r of the atomic clock stationed 
at the radio-telescope site, has to be connected with the time T of the pulse 
emission as measured in the co-moving frame of the pulsar. Thereby, one has to 
correct for a number of propagation and time-dilation effects within the solar 
system and, in case of binary pulsars, for propagation and time-dilation effects 
in the binary system. This transformation can be written as 

T = T "To “t“ ^ / f T 2\j^oemer,0 “t” 2ig},apiro,0 “t” 2\Einstein,0 

T T ^Shapiro , 6 T ^Einstein, b 5 ( 1 ) 

where tq denotes a reference epoch and Ac an offset between the observatory- 
master clock and the reference standard of terrestrial time which is determined 
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Fig. 2. A sequence of 100 individual pulses of the strong pulsar PSR B1133+16. The 
sum of all the individual pulses forms the characteristic integrated pulse profile as 
shown in the box of the top. 



using the Global Positioning System (GPS). The dispersive delay equals to H//^, 
where D is proportional to the column density of free electrons between pulsar 
and observer and / is the observing radio frequency. The parameter D and 
possible long-term variations of D can be determined by using observations 
at different radio-frequencies. In Effelsberg, for instance, we use 1.4 GHz as 
our prime timing frequency and in addition we make use of 0.8 and 2.7 GHz 
observations [27]. LlRoemer,©, ^Shapiro,©, and ZiEinstein,© account for propagation 
delays and time dilation effects within the solar system. If the pulsar is a member 
of a binary system similar terms accounting for these effects within the binary 
system are needed (index b in equation (1)). Full details on all of the terms in 
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Fig. 3. Definition of the five Keplerian parameters for a pulsar in a binary orbit: The 
location of periastron (point of closest approach to the center-of-mass of the binary 
system) is given by the longitnde of periastron, ui. The orbital period, P/,, is the time 
elapsed between two consecutive periastron passages and To is the date of a chosen 
periastron passage. The periastron distance is given by the eccentricity of the binary 
orbit, e, and the semi-major axis of the pulsar orbit, Op. From timing observations, 
however, only the combination Up sin i (projected semi-major axis) can be extracted, 
where i is the orbital inclination. 



equation (1) can be found in [11,49,17]. Each of these effects is described by a 
set of parameters which form three major groups: 

Spin parameters: Rotational frequency v of the pulsar and its time derivatives. 
Astrometric parameters: Position of the pulsar on the sky, proper motion, and 
parallax. 

Binary parameters: 5 Keplerian parameters (see Fig. 3) and a set of post-Kep- 
lerian (PK) parameters, which account for relativistic corrections to a New- 
tonian timing model. 

In particular for millisecond pulsars, where in many cases TOA measurements 
with sub-microsecond precision are achievable, timing parameters can be de- 
termined with excellent accuracy. For instance, the positional coordinates for 
some millisecond pulsars are known within a few micro-arc-seconds and orbital 
periods are typically determined with a fractional precision of 10~^*^. In addi- 
tion, while ‘normal’ pulsars show large random irregularities in their periods on 
time-scales of months and years, known as timing noise (see [31] and references 
therein), recycled pulsars are found to have a high rotational stability. Only a few 
recycled pulsars show deviations from a simple spin-down model on time-scales 
of a few years (‘red noise’) like PSR B1937-I-21 — the pulsar with the shortest 
rotational period (1.56 ms) — as shown in Fig. 4. The combination of rotational 
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stability and high timing-accuracy provide perfect conditions for high precision 
‘clock comparison’ experiments which have the power to test strong-field gravity. 
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Fig. 4. Post-fit residuals for the millisecond pulsar PSR B1937+21 after fitting for 
astrometric parameters, and the period and period derivative [25]. The unmodelled 
long-term variations in the residuals are understood to be caused by timing noise 
which strongly compromises the use of this pulsar as a long-term time standard. 



The important point for testing relativistic effects in binary pulsar systems is 
that the Keplerian and PK parameters can be measured in a phenomenological 
manner, independently of the choice of a specific theory of gravity. One expects 
that in a particular theory of gravity, the PK parameters can be written as func- 
tions of the pulsar and companion masses, mp and nic, and the well-determined 
Keplerian parameters. In general relativity, for instance, the five most important 
PK parameters are given in a first approximation by [11]: 



/ p \ —5/3 . 

uj =3T|/^ j (wp -b , 

_ ^ 2/3 rncjrnp + 2mc) 
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where to denotes the relativistic advance of periastron, 7 is the amplitude of the 
combined effect of relativistic time dilation and gravitational redshift, Ph denotes 
the time derivative of the orbital period due to gravitational wave damping, and 
r and s are two parameters related to the Shapiro propagation delay caused by 
the gravitational field of the companion. The masses mp and rric are expressed 
in units of solar masses (Mq), and we use the notations s = sint and Tq = 
GMq/c^ = 4.925490947 /xs. G denotes the Newtonian constant of gravity and c 
the speed of light. The projected semi-major axis x is defined as a; = OpSinz/c. 

The measurement of the five Keplerian parameters and two post Keplerian 
parameters will allow to determine the masses of pulsar and companion within 
a given theory of gravity. Once the masses are known with sufficient precision 
all the other PK parameters can be determined for this system. Hence, the 
measurement of more than two PK parameters allows to test the theory of gravity 
being used. More precisely, the measurement of n PK parameters determines n 
curves in the two dimensional mp-mc plane whose shape and position depend 
on the theory of gravity being applied. If the theory of gravity and the simple 
theoretical model give an accurate description for the binary system, the n curves 
meet at one point [17]. 

3 Binary Pulsars and Gravity Experiment I. 

Double— Neutron— Star Binaries 

Out of the 6 known double-neutron-star systems presently only two of them 
allow the determination of more than two PK parameters, which is necessary to 
conduct tests as outlined in the previous section. In the following we summarize 
the results for these two binary-pulsar systems. 

3.1 PSR B1913+16 

PSR B1913+16 was the first binary pulsar to be discovered. It was 25 years ago 
when Joseph Taylor and Russell Hulse found this double-neutron-star system 
during a systematic search for new pulsars using the 305-m Arecibo radiotele- 
scope [19]. At present three PK are measured for this system with high precision: 
the relativistic advance of periastron oj, the Einstein delay 7, and a change in 
the orbital period Pi,. Figure 5 shows the corresponding three curves in the rrzp- 
rric plane based on equations (2) to (4) and the observed values for these three 
parameters. 

Before the observed Ph can be compared to the theoretical value, we must 
apply a correction which accounts for the acceleration of the binary-pulsar sys- 
tem with respect to the solar system caused by the gravitational potential of the 
Galaxy and an apparent acceleration due to the proper motion of the binary sys- 
tem (Shklovskii effect). For PSR B1913-I-16 these corrections can be estimated 
with necessary precision and as a result, Cj, 7, and pcorrected excellent 

agreement (0.25%) with the predictions of general relativity [49,16,45,46]. In 
particular, the observed change in the orbital period confirms that the binary 
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Fig. 5. Mass-mass diagram for the PSR B1913+16 system. Labelled curves illustrate 
68% confidence ranges for indicated PK parameters. 



system is losing orbital energy at the rate which is predicted by the quadrupole 
formula of general relativity. Apart from this test, PSR B1913+16 is providing 
the most precise neutron-star masses ever measured: nip = 1.4411 ± 0.0007 and 
rup = 1.3879 ±0.0007. ^ 

Unfortunately, there is little hope that the precision of this 6j-^-Pb test can 
improve much further, since this would require a high precision in the determi- 
nation of the pulsar distance and a good knowledge of the gravitational field of 
our Galaxy in order to correct for acceleration effects present in the observed 
Pb- Presently the only way to determine the distance of PSR B1913±16 is based 
on models of the free electron distribution in our Galaxy, which rarely allow a 
precision better than 20%. 



3.2 PSR B1534±12 

With the discovery of PSR B1534±12 in 1990 by Wolszczan [59] a second double- 
neutron-star binary became available for testing relativity in strong field regimes. 

PSR B1534±12 is significantly brighter than PSR B1913±16, and its pulse has 

^ Note, these are the observed masses of pulsar and companion, which are related to the 
intrinsic (true) masses by mintrinsic = Dniohs- The parameter U is a Doppler factor 
characterizing the radial velocity, V7, of the binary-pulsar system. D ~ 1 — V7/c 
cannot be determined from timing observations and is set equal to one during the 
fitting process. 
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a narrow peak, leading to more precise timing measurements. In addition, the 
orbit is seen nearly edge-on and, as a consequence, the Shapiro propagation delay 
caused by the gravitational field of the companion star can easily be extracted 
from the timing data. Thus, in addition to w, 7, and Pb two PK parameters 
characterizing the Shapiro propagation delay are known for this double-neutron- 
star system, i.e. the “range” r and the “shape” s of the Shapiro delay [41]. The 
resulting five curves in the mass-mass digram are shown in Fig. 6. 




Fig. 6. Mass-mass diagram for the PSR B1534+12 system. Labelled curves illustrate 
68% confidence ranges for indicated PK parameters. 



The four curves for oj, 7, r, and s have a common solution in the rrip-mc 
parameter space {nip = 1.334 ± 0.002, nip = 1.344 ± 0.002) leading to non- 
radiative test of relativistic gravity, complementing the (radiative) w-y-Ph test 
forPSRB1913+16[50j. 

The fifth curve corresponding to the observed Pt does not agree with the 
solution for the Cj-^-r-s test. As in the case of PSR B 19 13+ 16 the observed Pb 
is the sum of an apparent and an intrinsic change of the orbital period. However, 
for PSR B 1534+ 12 the non-intrinsic part of Pb ammounts to more than 30% 
of the observed Pb- The Pb curve does agree with the other four curves if one 
assumes a pulsar distance of 1.08+0.15 kpc [42]. In order to perform a uj-j-Pb 
test for this system a precise distance estimation is needed. A reliable value for 
the distance could either come from the measurement of a timing parallax or 
the measurement of a parallax distance using VLBI observations. 
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4 Binary Pulsars and Gravity Experiment II. 

Small— Eccentricity Binary Pulsars 

The majority of binary pulsars is found to be in orbit with a white-dwarf com- 
panion. Due to the mass transfer in the past, these systems have very small 
orbital eccentricities. Therefore, the location of periastron is not at all a promi- 
nent feature in pulsar timing observations, and neither to nor 7 were measured 
for any of these binary pulsars. In fact, the only PK parameters measured with 
reasonable accuracy for a small-eccentricity binary pulsar are the two Shapiro 
parameters, r and s, in case of PSR B1855-I-09 [25]. However, since the orbital 
period of this system is 12.3 days, the expected Pb is by far too small to be of 
any importance for timing observations and, consequently, there is no third PK 
parameter which would allow the kind of test conducted with the two double- 
neutron-star systems above. 

On the other hand, many alternative theories of gravity, tensor-scalar theo- 
ries for instance, predict effects that depend strongly on the difference between 
the gravitational self-energy per unit mass (e = jm(?') of the two masses 

of a binary system. While this difference in binding energies is comparably small 
for double-neutron-star systems, it is large in neutron star-white dwarf systems 
since for a white dwarf e ~ lO”"* while for a 1 . 4 M 0 neutron star e « 0.15. 

Damour and Eposito-Farese developed a field-theory based framework for 
discussing and interpreting experimental tests of relativistic gravity at a second 
post-Newtonian (2PN) level [12]. This framework is based on a class of tensor- 
multiscalar theories in which gravity is mediated by a tensor field together with 
one or several scalar fields. Within this tensor-multiscalar framework, 2PN devi- 
ations from general relativity can be fully described by two parameters, e and C. 
Damour and Eposito-Farese come to the conclusion that it is extremely difficult 
to measure these parameters in the solar system. In a second paper [13] they 
go even further and demonstrate that certain classes of tensor-scalar theories 
develop nonperturbative strong field effects which lead to significant deviations 
from general relativity in conditions involving strong gravitational fields. Since 
such strong field effects only occur in neutron stars above a certain critical mass, 
they are completely inaccessible by solar-system experiments and, at present, 
could only be detected in binary pulsar experiments. It was pointed out by 
Damour and Eposito-Farese in [14], that future LIGO- VIRGO observations of 
inspiralling compact binaries are not competitive with present binary-pulsar 
tests in their discriminating probing power of strong-held gravity effects. Thus, 
binary pulsars are and will continue to be superb tools for testing strong-held 
aspects of gravity. 

While double-neutron-star systems, like PSR 1534-1-12 and PSR B1913-I-16, 
already tightly constrain tensor-multiscalar theories of gravity, there are theo- 
retical predictions for which neutron star-white dwarf systems provide better 
testing grounds, like the existence of gravitational dipole radiation and the vio- 
lation of the strong equivalence principle. 
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4.1 Gravitational Dipole Radiation 



Unlike general relativity, many alternative theories of gravity predict the pres- 
ence of all radiative multipoles — monopole and dipole, as well as quadrupole and 
higher multipoles [56]. For binary systems scalar-tensor theories, for instance, 
predict a loss of orbital energy which at highest order is dominated by scalar 
dipole radiation. As a result, the orbital period, Pt,, should change according to 



p(dipole) 



1 -I- e^/2 mpiTic , 

c^Pb (1 — mp + nic 




( 7 ) 



where —apUip and —acrric are the total scalar charge of pulsar and companion, 
respectively. G* is the ‘bare’ gravitational constant and the masses Trip and 
rric are measured in units of solar masses Mq. For a white dwarf companion 
jocl 1 and thus the expression (op — in equation (7) can be of the order 
one if the pulsar develops a significant amount of scalar charge. In this case 
the gravitational wave damping of the orbit is completely dominated by the 
emission of gravitational dipole radiation. Note, in double-neutron-star systems 
where the two components have similar mass the term («p — «c)^ is very small 
and, therefore, gravitational dipole radiation would be of much less importance 
than in neutron star-white dwarf systems. 

PSR J1012-I-5307 is a 5.3 ms pulsar in a 14.5 h circular orbit with a low mass 
white-dwarf companion. Since its discovery in 1993 [35] this pulsar has been 
timed on a regular basis using the Jodrell Bank 76-m and the Effelsberg 100-m 
radiotelescope, sometimes achieving a timing accuracy of 500 ns after just 10 
min of integration [27,28]. In addition, the white-dwarf companion appears to 
be relatively bright (V = 19.6) and shows strong Balmer absorption lines. Based 
on white dwarf model calculations, a companion mass of nic = 0.16 ± 0.02 and 
a distance of 840 ± 90 pc was derived [51,8]. Further, a reliable radial velocity 
curve for the white dwarf has been extracted, which then, in combination with 
the pulsar timing information, gave a mass for the pulsar of nip = 1.64 ± 0.22. 
Since e ~ 0 for this binary system, we find from equation (7) 

p^(dipole) _ _5 X j^Q-lO ^2 _ 



On the other hand, the change of the orbital period as predicted by general 
relativity amounts to —10“^^, which is more than four orders of magnitude less 
if lofpl is of the order one. A comparison with the limits on Pb, as extracted from 
pulsar-timing observations, i.e. Pb = (0.1 ± 1.8) x 10“^^, sets an upper limit of 



\ap\ < 0.02 . (9) 

Using the 2PN parameter in equation (8) can be replaced by CpC where 

Cp is the compactness of the neutron star. For a medium equation of state Cp « 
0.21 nip/MQ holds and we find 

-3 X 10-3 < C < 3 X 10-3 , (10) 



which is the so far tightest limit on the parameter C [28] . 
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Provided regular timing observations are conducted on a monthly basis, limits 
on the change of the orbital period for this system should improve by a factor 
of five for this system within the next three years. 

4.2 Violation of the Strong Equivalence Principle 

The strong equivalence principle (SEP) requires the universality of free fall of all 
objects in an external gravitational field regardless of their mass, composition 
and fraction of gravitational self-energy. While all metric theories of gravity 
share the property of universality of free fall of test particles (weak equivalence 
principle), many of them, which are considered as realistic alternatives to general 
relativity, predict a violation of the strong equivalence principle. It was pointed 
out by Nordtvedt [37] that laboratory-size bodies possess a negligible fraction of 
gravitational self-energy and therefore laboratory-free-fall experiments indicate 
nothing about a violation of the SEP. Nordtvedt [38] further pointed out that 
lunar-laser-ranging experiments have the potential to test a violation of the SEP 
due to the difference in fraction of gravitational self-energy between the Earth 
(e ~ —4.6 X 10“^°) and the Moon (e ~ —0.2 x 10“^°) which are both exposed 
to the (external) gravitational field of the Sun. A violation of the SEP can be 
understood as an inequality between the gravitational mass, m,g, and the inertial 
mass, nii, which can be written as function of e: 

^ = 1 + S{e) = l + ge + 0 (e^) . (11) 

rrit 

While the analysis of lunar-laser-ranging data tightly constrains the ‘Nordtvedt 
parameter’ rj [58,33] it indicates nothing about a violation of the SEP in strong- 
held regimes, i.e. terms of higher order in e, due to the smallness of e for solar- 
system bodies. For neutron stars, however, e ~ 0.15 and thus binary-pulsars 
with white-dwarf companions (e ^ 10“^) provide ideal laboratories for testing a 
violation of the SEP due to nonlinear properties of the gravitational interaction 
[15]. 

In case of a violation of the SEP the eccentricity vector of a small-eccentricity 
binary-pulsar system with a white-dwarf companion exposed to the external 
gravitational held of the Galaxy evolves due to Fig. 7. For the length of the 
‘induced’ eccentricity vector ep one hnds 

ep (X (Sp - Sc) Ph a_L ■ (12) 

Therefore, small eccentricity binaries with long orbital periods are most sensitive 
to a violation of the SEP. Since neither 6 nor ep are known quantities in our 
analysis we have to treat them in a statistical manner in order to get limits for 
ep. The angle 9, for instance, is assumed to be uniformly distributed between 0 
and 27 t. On the other hand, given a certain {Sp — Sc) — Sp, i.e. a certain cf for a 
given binary pulsar, the observed eccentricity, e, sets an upper limit on 9 which 
is independent of ep. If we now compare the limit on 9 which we get for every 
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Fig. 7. Evolution of the eccentricity vector e (vector of length e pointing to periastron) 
for a small eccentricity binary system in case of a violation of the SEP [15]. e is a 
superposition of the constant vector es and the vector e_R which is turning in the 
orbital plane with the rate of the relativistic advance of periastron, i.e. 6 — 9o + u>t. 
The vector aj^ denotes the projection of the Galactic acceleration onto the orbital 
plane. 



observed small-eccentricity binary-pulsar system with a uniform distribution 
using Monte-Carlo simulations we obtain an safe upper limit for Sp which is 

|,5p| < 0.009 (95% C.L.) (13) 



(For details see [55]). 

In the framework of tensor-multiscalar theories 6p can be expressed using 
the 2PN parameters e and C. For a medium equation of state one finds 

i,»0.045 (d)'(| + c). (14) 

Consequently, assuming a typical neutron-star mass of 1.4Mq one obtains 



e 

2 



+ c 



< 0.1 



(95% C.L.) . 



(15) 



Since C is already tightly constrained by equation (10) we find a safe upper limit 
for [e] of 0.2. Further restrictions on e can be derived from the PSR B1534+12 
and PSR B1913+16 tests (see [12] for details). 



4.3 Violation of Local Lorentz Invariance and Conservation Laws 

If gravity is mediated in part by a long-range vector field or by a second tensor 
field one expects the global matter distribution in the Universe to select a pre- 
ferred frame for the gravitational interaction [57] . At the post-Newtonian level, 
gravitational effects associated with such a violation of the local Lorentz invari- 
ance of gravity are characterized by two theory dependent parameters a\ and 02 - 
The close alignment of the Sun’s spin axis with the total angular momentum of 
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the solar system yields a tight bound on the second parameter, a-i < 2.4 x 10“^ 
[39]. If a\ were different from zero, a binary system which moves with respect 
to the global matter distribution in the Universe would suffer a secular change 
of the orbital eccentricity e. In fact, for small eccentricity binary systems the 
evolution of the eccentricity vector e is the same as illustrated in Fig. 7 for a 
violation of the SEP. This time, however, aj_ represents the direction of motion 
with respect to the preferred frame projected onto the orbital plane and the 
‘induced’ eccentricity vector ep has the property 

[6^1 cx oi \mp - TOcj w± , (16) 

where wj_ is the projected velocity of the binary system with respect to the 
global matter distribution, i.e. the cosmic microwave background. Again we can 
perform a Mote-Carlo analysis as outlined in the previous section and derive an 
upper limit for the parameter ai from this [55]: 

[ail < 1.2 X 10"^ (95% C.L.) (17) 

This limit is slightly better than the limit obtained from lunar-laser-ranging 
data [34] and, more importantly, also holds for strong gravitational-field effects 
which could occur in the strong-field regions of neutron stars. Due to its small 
eccentricity, e < 1.7 x 10“® (95% C.L.), and high velocity with respect to the 
cosmic microwave background {w « 500 km/s), PSR J1012+5307 turns out to 
be the most important binary system for this kind of analysis [28]. 

In theories of gravity which violate the local Lorentz invariance and the 
momentum conservation law, a rotating self-gravitating body will suffer a self- 
acceleration which is given by [40] 



aseif = - Y e w X (18) 

where «3 is a theory dependent parameter and e is the fraction of gravitational 
self-energy of the body moving with velocity w with respect to the preferred 
frame, fi denotes the rotational velocity of the body. Again, binary pulsars 
are ideal probes for this kind of self-acceleration effects [6]. A careful analy- 
sis analogous to the previous analyses (SEP, local Lorentz invariance), which 
appropriately takes care of selection effects, gives 

Jag] < 1.5 X IQ-i® (95% C.L.) (19) 

as a safe upper limit for a combined violation of local Lorentz invariance and 
momentum conservation [55]. 

5 Geodetic Precession 

In general relativity the proper reference frame of a freely falling object suffers 
a precession with respect to a distant observer (geodetic precession). This was 
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tested with high precision in the static gravitational field of the sun using the 
Earth-Moon reference system [58,34]. In a binary-pulsar system this geodetic 
precession leads to a relativistic spin-orbit coupling, analogous to spin-orbit 
coupling in atomic physics. As a consequence, the spin of the pulsar precesses 
about the orbital angular momentum with an angular frequency [4,7] 

Wc(4TOp + 3me) 1 
® 2{mp + 1 — 

Since the angular momentum of the orbital motion is much larger than the 
spin of the pulsar it practically represents a fixed direction in space, defined by 
the orbital plane of the binary system. For PSR B1913+16, for instance, 
amounts to 1.21° per year, i.e. it takes 300 years for a full 360° precession. If 
the spin of the pulsar is sufficiently inclined with respect to the orbital angular 
momentum, the geodetic precession of the pulsar will notably change the spin- 
direction of the pulsar. As the angle between the pulsar-spin axis and the line- 
of-sight to the pulsar changes, our view of the pulsar-emission region changes 
and, consequently, a secular evolution of the observed pulse profile is expected. 

For PSR B1913+16 long-term changes in the 1.4 GHz pulse profile were first 
reported ten years ago by Weisberg, Romani & Taylor [53]; (see Fig. 8 for a plot of 
the pulse profile). They observed a change in the relative amplitude of the leading 
and trailing sub-pulse of the PSR 1913+16 pulse-profile which was understood 
as the first detection of the effects of geodetic precession in a binary system. 
Based on a simple model for the intensity of pulsar-radio emission, Istomin [20] 
derived a value of 25° for the angle between the spin axis of the pulsar and 
the orbital angular momentum and made the prediction that the pulsar will 
become unobservable near the year 2020, when geodetic precession moves the 
line-of-sight outside the emission region of PSR B1913+16. Cordes, Wasserman, 
and Blaskiewicz [10] used polarization data to constrain the orientation of the 
pulsar and argued that a change in the separation between the two sub-pulses, 
as expected from a simple hollow-cone model for the pulsar-radio emission, has 
not been observed so far due to an unfavourable phase in the precession. In 
1998, eventually, Kramer [26] reported the detection of changes in the sub-pulse 
separation for PSR B1913+16 based on five years of observations at the 100-m 
Effelsberg radiotelescope and showed that these changes are in agreement with 
a hollow-cone model. New Arecibo observations [46] and Effelsberg polarization 
data [22] lead to improved restrictions on the orientation and emission-geometry 
parameters of PSR B 19 13+ 16 (based on pure geometrical arguments). It is worth 
noting, that these restrictions still allow for a geometry where the line of sight 
will never move outside the emission cone and the pulsar therefore is always 
visible. 

Only recently Stairs et al. [43] reported a secular evolution of the PSR 
B1534+12 (12P"'°° = 0.51°/yr) pulse profile. In both cases, PSR B1913+16 and 
PSR B1534+12, the changes in the observed pulse profiles are generally accepted 
as a qualitative test for the presence of geodetic precession in these systems. To 
date, however, in non of these two cases one can derive useful limits to the rate of 
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Fig. 8. Pulse profile of PSR B1913+16 at 1.41 GHz. 



precession, Due to the uncertainties in pulsar emission it remains an open 

question whether one day these experiments will allow for a quantitative test of 
geodetic precession. PSR B1534+12 seems to be the more promising candidate 
since its magnetic geometry is well constrained from polarization observations 
[2]. For pulsar astronomy geodetic precession provides a unique opportunity to 
obtain two-dimensional information on the structure of a pulsar emission zone 
[53,54]. 

Two more pulsars from Table I have a significant rate of geodetic precession, 

both PSR B2127+11C and PSR J1141 6545, exceeding PSR B1913+16 in its 

precession rate with 12?"'®® = 1.3°/yr and I??"®® = 1.7°/yr, respectively. 'While 
PSR B2127+11C is a weak pulsar where it is difficult to get a pulse profile with 
sufficient signal-to-noise ratio, the newly discovered pulsar PSR J1 141-6545 is 
likely to show interesting pulse-profile changes within the next few years. 
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Abstract. We present a second-quantized field theory of massive spin one-half parti- 
cles or antiparticles in the presence of a weak gravitational field treated as a spin two 
external field in a fiat Minkowski background. We solve the difficulties which arise from 
the derivative coupling and we are able to introduce an interaction picture. We derive 
expressions for the scattering amplitude and for the outgoing spinor to first-order. In 
several appendices, the link with the canonical approach in General Relativity is es- 
tablished and a generalized stationary phase method is used to calculate the outgoing 
spinor. We show how our expressions can be used to calculate and discuss phase shifts 
in the context of matter-wave interferometry (especially atom or antiatom interfer- 
ometry). In this way, many effects are introduced in a unified relativistic framework, 
including spin-gravitation terms: gravitational red shift, Thomas precession, Sagnac 
effect, spin-rotation effect, orbital and spin Lense-Thirring effects, de Sitter geode- 
tic precession and finally the effect of gravitational waves. A new analogy with the 
electromagnetic interaction is pointed out. 



1 Introduction 

The development of high accuracy atom interferometers, used as clocks in the 
microwave or in the optical domain, as inertial sensors (gyros, gravimeters, gra- 
diometers...) or for the determination of atomic masses and of the fine structure 
constant [1-11], requires now a framework to describe the interference of atom 
waves in a rigorous way. On one hand, one needs to investigate general rela- 
tivistic effects including those involving the spin of the atoms and, on the other 
hand, it is necessary to take into account the statistical properties (bosonic or 
fermionic) of the interfering particles, given the development of coherent atom 
wave sources and also for a proper treatment of the detection noise. One would 
also like to be able to discuss the propagation of antimatter in interferometers 
in the presence of gravitation and as suggested in reference [22] the properties of 
coherent antimatter waves (generated by an antiatom laser such as an antihydro- 
gen Bose-Einstein condensate) . This is possible only within relativistic quantum 
field theory. Atoms (or antiatoms), in a given eigenstate of the internal atomic 
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Hamiltonian, are considered as elementary particles having a rest mass fixed by 
the energy of the atomic level and a spin equal to the total angular momentum 
of the atom in that level. In this paper, we shall consider only Dirac particles for 
illustration. This is the simplest example of particles with spin which still con- 
tains most interesting effects related to spin and applies to neutron or electron 
interferometry as a special case. The generalization to others spin values is pos- 
sible along similar lines with Dirac-type equations (Bargmann-Wigner [27], de 
Broglie fusion method [28], Durand [29]). Our overall goal is to introduce grav- 
itation and general relativistic effects at the quantum level of modern atomic 
physics and quantum optics experiments. For this, we propose an extension of 
our first paper on atom interferometry in General Relativity [12], which includes 
now a second-quantization scheme for the atom waves in the presence of grav- 
itational and electromagnetic fields. The point of view adopted in this paper is 
the extrinsic point of view using purely quantum field theory in a flat Minkowski 
background. The connection with the canonical intrinsic approach, using Dirac 
equation in curved space-time, is made in Appendices A and B. The reader who 
wishes to start with this canonical approach is thus invited to read first these 
appendices. 

In the main text, we begin right away with the minimal coupling Lagrangian 
in flat space-time, which is identical to the one derived in curved space-time 
for a standard choice of tetrads. Then, we proceed with the quantization of 
the Dirac field and we emphasize the difficulties which arise because of the 
derivative coupling. These difficulties are solved in a consistent scheme which 
allows also to define an interaction picture. The evolution operator and the S- 
matrix are constructed and we demonstrate explicitly a conjecture of Gupta. 
These results are used to derive formulas for the transition amplitude and for 
the outgoing spinor in the weak-field approximation, first in configuration space 
and second in the momentum representation. An expansion in the perturbation 
wave vector (hk/mc) is used to retrieve various physical effects, some of which 
are well-known. A new analogy with the electromagnetic interaction is presented 
which includes all components of the field and generalizes gravitoelectric 
and gravitomagnetic interactions. In Appendices G and D, calculations of the 
outgoing spinor are sketched, first with a generalized stationary phase method 
in configuration space and second in the momentum representation. 

2 Lagrangian Theory 

Gonsidered as a field theory in flat spacetime, the theory describing the interac- 
tion of matter with a given gravitational field will be defined by a Lagrangian 
density of the following form [15,19]: 

£ = £o - , (1) 

where is the given external field, and where £q is the free Lagrangian density 
of the matter field and the corresponding stress-energy tensor. For a Dirac 
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field, one has respectively^, in symmetrical form, 
he 



jC-ci — — 



h 






rr - r , 



^ “ f^7At) '1' + ^ - i%lv^ 



(2) 

(3) 



The total Lagrangian density (1) then becomes 

£ = (1 + \h) £o - lihch^^'' W - "9 ,7^) <f, (4) 

where^ h = . This Lagrangian density can also be considered 

as obtained from the Lagrangian density valid in General Relativity for the 
interaction of the Dirac field with a prescribed gravitational field in the linear 
approximation (see the Appendices). 

The equations for !?' and derived from (4) are the following^: 



[(1 + \h) - mc/h) - + \d^hr\l' = 0, (5) 

- mc/h) (1 + \h) + - ^d^h'y^] = 0. (6) 

They admit the conserved current 

f (7) 

which coincides with the usual Dirac current when the gravitational field van- 
ishes. 

In order to stress some other differences and analogies with electromagnetism, 
we may write the equation for 'P with a covariant derivative in the usual sense 
of non~Abelian gauge field theories in flat space-time 

where 

^ ( 9 ) 

^ The conventions used here for the metric and for the Dirac equation and matrices 
are generally those of [17]. In particular, the signature of the metric is taken as 
Greek indices run from 0 to 3 and latin indices run from 1 to 

3. The space-time 4-vector is written x = {x^, x) = (ct, x). The partial derivatives 
with the right arrow act on the right and those with the left arrow on the left. 

^ The term (h/2)jCo in (4) comes from the term containing £q in (3), and was omitted 
in [16]. However, £q which vanishes when the free Dirac equation is satisfied, does 
not vanish here. 

® If one considers these equations as first-order equations with respect to the 
the factor (1 + h/2) in the first term can be replaced by 1. But it can be shown 
that the equations so obtained cannot be derived from a Lagrangian, if this latter is 
restricted to depend linearly on P and P, and to admit first-order derivatives only. 
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and where the factor (l + has been removed. The Poincare generators are 
associated with gauge fields and a local gauge invariance^ 



^\x) 



1 + - 9 ^ 6 ) - edx 






(10) 

( 11 ) 



3 Quantization 

We want now to proceed with the quantization of the field W submitted to the 
interaction defined by (4) . When they are applied directly, the standard methods 
lead to some difficulties coming from the presence of a derivative coupling in the 
Lagrangian. These problems are, first, briefly discussed, then, a solution is pre- 
sented allowing the quantization together with the definition of the interaction 
picture. 



3.1 Difficulties with the Derivative Coupling 

Two methods can be used, a priori, according to whether the Lagrangian is 
taken under a symmetrical or a asymmetrical form. 



Symmetrical Lagrangian. Starting from the symmetrical Lagrangian (4), the 
usual anticommutation relations will be obtained from the following expression 
of the conjugate momentum 

dC 

n = 2 — ^ , with A = j (12) 

d'l' 

which gives 

7T = zM[(l+i/i)70-i/i°^7j, (13) 

and 

{'PaA),n/^{y)},^o=yO = ih 6 a /3 S{x - y) (14) 

where a and [3 are spinorial indices. 

By introducing the matrix (depending on the coordinates) 

7 -' = (1 + \h)A - , (15) 

these formulas can be rewritten 

n = iMr"f~^, (16) 

^ Differences with usual Yang-Mills theories come from the non-commutation of the 
Lorentz generators with the Dirac matrices and from the fact that the translation 
generators act on space-time itself. 
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and 

<f'/3(y)}a;0=j/0 = (7(a;))a/3 <5(a^ - y)- (17) 

For the free field the matrix 7(0;) reduces to 7*^. However, it appears that the 
Heisenberg equations, which are generally equivalent to the equations of motion, 
are not satisfied here under their most usual forms. To write the formulas in a 
condensed form, let us introduce the two operators 



^ = he 


(1 + i/i)(-i7"^ + mc/h) + - d^hh 


M > (18) 


0 

II 


+ mc/h){l + i/i) - 


• (19) 


One then finds that the Hamiltonian density III' — C can be written 


as 




n = + w^<i'], 


(20) 


while the field equation reads 






ihej = itiF. 


(21) 


The relation and the formula (7 ^)1 = 7^7 ^7° then give the 


conjugate equation 






—ihedoW ^ ^ . 


(22) 


Using an integration by parts, the total Hamiltonian H can be transformed into 




H = J j (d3x)tf'(ao7”^)'F. 


(23) 


The anticommutation relation (17) then gives the commutator 






[H, If] = -yittF - ^fic(79o7 > 


(24) 


or, given the field equation. 






[H,'I] = —ihedo'! — ^)iF. 


(25) 


Similarly, 


one has 






[H,!'] = -ihcdol' - ^hcI'{doj~^h, 


(26) 


then 


[77,77] = —ihcdoll + ^hcII"/doj~^ . 


(27) 



The second terms in the right members of (25), (26) and (27) are unusual, since 
the fields <F, W and II behave as if, considered as functions of some fundamental 
dynamical variables, they were also explicitly dependent on the time. From the 
expression of II, it is in fact obvious that, among I' and II one of them, at least, 
explicitly depends on time. But this is not obvious for I'. 
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Asymmetrical Lagrangian. The preceding difficulty takes another form if, as 
it is more usual [18], the Lagrangian (4) is replaced by the asymmetrical one 



C' = C+'-^d,f 



= he 'p 






The field equations are left unchanged, and the conjugate field is again: 

n' = ^ = 
dP 



(28) 



(29) 



Since II' = II, the anticommutation relation of P and W is identical to (17), but 
the Hamiltonian is now 

H' = J {cPx)P^'P = J - ihc j {d^x)P{do-/~^)'P. (30) 

It follows that the usual Heisenberg equations are satisfied. In fact, one has 

[H', <P] = - 7 ^if = -ihcdoP , (31) 

[H',W\ = = —ihcdo'I — ihcI'{doj~^)j, (32) 

then 

[H' , n'] = -ihedoH' . (33) 



The equations satisfied by P and II' are those of variables having no explicit time 
dependence, which is the rule for canonical variables. On the contrary, expression 
(29) shows that 1/ depends explicitly on time since 7 “^ does, and equation (32) 
is in agreement with this dependence. 

However, the Hamiltonian H' is not Hermitian. In fact, since one has 
= it follows that 



iL't 






{cPx)'P{do-i~^)I'. 



(34) 



The asymmetry of the equations for 'P and S' is a consequence of this lack of 
Hermiticity. 



Trouble with the Interaction Picture. A common shortcoming of the two 
preceding methods of quantization is the absence of a coherent definition of the 
interaction picture. In fact, if this picture was defined, there would be a unitary 
operator U , such that the corresponding field variables ip and w would be given 

by 

'iP = UPU~^, w = UnU~\ (35) 
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Moreover, these variables would be free fields so that one would have the relation 
vj = Such a relation is not compatible with (35) since one has II = 

7 “^ 70 ) and since U must be unitary. 

In what follows, the quantization is defined in a way such that the preceding 
difficulties do not appear. In particular, the interaction picture will be defined, 
allowing the construction of the transition probabilities and that of the S matrix. 

3.2 A Coherent Method of Quantization 

The afore-mentioned problems will be solved by a change of variables eliminating 
the derivative coupling. The Lagrangian density (4) can be written under the 
form ., .. 

(36) 

Let us introduce the new field 0 by the formula 1/ = AO, where A is a matrix 
to be determined, which depends on the coordinates. One has 

'f'7“^(5o>f') = 6>7°A'^7°7~^A(5 o 6>) + 6>7°A'''7°7“^(9oA)6>. (37) 

The terms containing the time derivatives of 0 and 0 in (36) will be those of the 
free Dirac Lagrangian if one has = 7 *^ or, assuming the inversibility 

of A, if 7 ° 7 “^ = A^~^A~^. By writing the matrix A under the form A = MU 
where M is Hermitian and positive [34] and U unitary, the preceding equation 

Hpr*Ckmp<5 

7 O 7-1 = = I+\h- (38) 

If h^i, is sufficiently small, this matrix is inversible and defines a unique matrix 
M positive-definite [34], while U may be arbitrary. However, to ensure that the 
asymptotic states deduced from 0 or W have the same physical interpretation, 
it is necessary that U goes to the identity when t — >■ ± 00 . More precisely, we will 
impose the condition C/ = 7, by which, when = 0 , the field 0 is a free field 
as '7'. 

The Lagrangian density takes the following form as a function of 0: 

£ = fic ^ ( 07 °( 5 o 0 ) - (5o0)7°0) + \ (or^dkO) - {dk0)r'^0) + \or0 , 

(39) 

where and F are defined by 

Fk = ^ _ 1 (40) 

F = i {-/°A~^doA + F'^A-^dkA) 

- if ((aoAt)At-i 7 ° + (afcAt)At-i( 7 °TS°)) 7 ° - 2^(1 + \h)fAf^A . 

(41) 



Let us note the Hermiticity relations 

(_pfc)t = fr'^f ^ 



= frf . 



(42) 
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The field equations, equivalent to those of and 'F, are now 

i {j°{doO) + r\duO)) + l(r + i{dkr>^)) o = o, (43) 

-z ((9 o 0)7° + (afe0)T'=) + lo{r- z(5feT'=)) = 0, (44) 

while the current reads 

f = cO-f°A^° [(1 + AO , (45) 

or, more explicitly®, 

f = c07°0, j'^ = cOr^O . (46) 

The conjugate momentum IIq of 0 has the same form as in the free-field 
case 

3C. — 

Tie = 2 ^ = ih0f . (47) 

O0 

It follows that the anticommutation relation of 0 and 0 is the usual one 

{0a{x),0i){y)}^o=yo = 7°^ 5{x - y). (48) 

It is equivalent to the anticommutation relation (17) of F and 'F. The Hamilto- 
nian density He now reads 

ne = ~ (0r\dk0) - (dk0)r’^0) - ^0r0, (49) 

which gives for the total Hamiltonian 

He = - y J {d\) [i {0r’^{dk0) - {dk0)r^0) + 0F0] , (50) 

or, equivalently. 

He = -he J{d^x)0 ir'^{dk0) + l(r + ^(afcT'=)) 0 , (51) 

= hcj (d^x) t{dk0)r’^ (52) 

This operator is Hermitian, and from the preceding expressions and, from the 
field equations (43) and (44), one checks the Heisenberg equations 

[Tie, 0] = —ihcdo0 , [He, 0] = —ihcdo0 , [Tie, Tie] = —ihedoHe ■ 

(53) 

® The expression of is identical to that of the free-held case. In Appendix B this 
property is taken as a condition allowing the introduction of the field 0 in the 
framework of the linearized theory of General Relativity. 
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The difficulties discussed in Section 3.1 have disappeared. In particular, the basic 
variables 0, 0 and Uq look like variables having no explicit dependence on time. 
On the contrary, the variables W, and U, initially considered, depend explicitly 
on time, since one has 



rp = A0, = 6 > 7 ° 41 ' 7 ° , n = neA-\ 


(54) 


and since the matrix A generally depends on time. 

Let us remark, however, that the Hamiltonian Hq differs from the Hamilto- 
nian H introduced at the beginning in terms of It is convenient to consider 

H from (23) as the integral of the density 


= ^+^(5o7”^) 1', 


(55) 


and Hq from (51) as the integral of the density 




Ue = hc 0 -ir^dl-]^{r + i{dkr'^)) 0 . 


(56) 


A rather tedious transformation of the former expression, using the expressions 
of 7 “^, r^, ^ , and the definition of 0, leads to the formula 


TL^ = He + ^ 07 ° [4 ^{d^A) — (9o4^)A^ 0. 


(57) 


Conversely, one has 




ne = n^ + ^5^7° [4^-'(ao4-') - (ao4^"')4-i] S' 


(58) 


With the choice made above of the matrix [/, one has A = M = 
defined by (38). The two preceding formulas can then be written 


Afl, with M 


= He + ^ 07 ° [M-\doM] 0, 


(59) 


%0 = %^+ ^^7° [M-i,9oM-i] If. 


(60) 



It is noticeable that the equality = He is valid at first order with respect to 
the In fact, since dgM and doM~^ are first-order quantities, this approx- 

imation is obtained by taking, for the first term in the commutators in (59) and 
(60), the zeroth-order approximation of M~^, that is the matrix unity. 

4 Interaction Picture and the S— Matrix 

With the Lagrangian (39) the interaction picture is easily defined, since the 
corresponding conjugate momentum Uq has the same form as that of the free 
theory. This allows us to define the evolution operator in that picture, then the 
S'-matrix. 
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4.1 Evolution Operator and Transition Amplitudes 

The field in the interaction picture will be denoted by 9. Let us recall that this 
operator is obtained from the Heisenberg operator 0 by a unitary transformation 
such that the field equation becomes the free one [32] . Accordingly, the evolution 
equation of the state vector reads 

= (61) 

where, in the absence of derivative coupling, which is the case for the Lagrangian 
(39), the Hamiltonian Hi(t) is equal to the interaction Hamiltonian expressed 
in terms of 9. From the expression (50) of the total Hamiltonian one gets 

Hi{t) = J {d^x)nint{x), (62) 

with 

Hint = - l'^){dk9) + ^(9fe0)(H'= - 7'=)0 - y0(H + 2^^)9. (63) 

The evolution operator in the interaction picture is, from (61), the solution 
of the following equation together with the initial condition [21] : 

ih^U(t,to) = Hi{t)U{t,to) , U{to,to) = I- (64) 

The perturbation theory is then obtained from the integral equation, equivalent 
to (64), 

U{t,to) = I - ^ [ Hi{T)U{T,to)dT . (65) 

In what follows, we are interested in the transition amplitudes to first order 
with respect to the /i^^’s. These amplitudes will be obtained from the first- 
order approximation with respect to the Hamiltonian Hj of the C/-operator, 
namely 

= -^ [ Hj{T)dT. (66) 

By introducing the Hamiltonian density (63) in normal form®, and the initial 

and final states of the transition, we will consider the amplitudes 

{<Pf\U^^Ht,to)m = -^^j\d^x){<Pf\:nUx)-m, (67) 

® See [21]. To first order this prescription suppresses an infinite contribution, due to 
the energy of the vacuum, in the transition amplitudes for the antiparticles only. 
It can be seen, therefore, as an expression of the symmetry between particles and 
antiparticles. 
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where the pair of colons : : denotes as usual the normal product. 

The first-order expressions needed for the evaluation of (67) are now derived 
from (38) which gives 

yl = M = /- J + (68) 

then, from (40) and (41), 

- 7^= = ^ (69) 

r + 2^ = + ^(5,/r°-)(7S°7, - 7./7'=). (70) 

The corresponding expression for the Hamiltonian density "Hint is 

Hint = ~0 [/i° 07 '= - {dkO) 

+ ^(5fc0) [/i°°7'= - e 

+ ^0 !^/i00-l(afc/r°'^)(7SV-7M7V) 0. (71) 

In Section 5, the expression of the Hamiltonian Hi{t) which will be used is the 
expression obtained from the space integral of (71) by performing the integration 
by parts of the term containing dkO, which gives 

Hi{t) = J {d^x)^^j°VG{x)9{x), (72) 

where the operator Vg{x), acting on 6{x), is given by 

Vg{x) = y7° _ y-yfc) 

+ ^(29fe/r0'=7° + - 9fc/i°°7'=) 

+ ^7° [2/i°S° + - /i°°7l dk . (73) 

This form of the Hamiltonian is closely related to the equation of motion (43) of 
the Heisenberg field 0. In fact, with the help of (69) and (70), one checks that 
this equation can be written: 

— mc^) 6> = 7° Vg0- (74) 

In what follows, the initial and final states, which appear in (67), are some 
one-particle or antiparticle states. They are defined from positive or negative 
energy solutions Xi ^tnd X/ of the free Dirac equation 

= ^(*7''^ + W^) = 0> k = ij. (75) 
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Denoting by \xk) the corresponding states, one has then to calculate the matrix 
0 ]^0 mG nt 

(x/l : 0{x)x^VG{x)e{x) : \x^) ■ (76) 

As a free field, the operator 9 can be written 



0{x)=i2 

r=l ^ 



br{p)x^j^hx) + dl{p)x^pj{x) 



(77) 



where br{p) and dr{p) are the annihilation operators for the particles or antipar- 
ticles, respectively, and x^r the positive or negative energy solutions of the free 
Dirac equation given by [17,21] 



Xp,r (^) 



1 



(27r/i)^/^ Y E{p) 

I 

1 



mc^ 



(27r/i)^/^ Y ^(P) 



(r) (^p-^^iip-x-E(p)t)/h ^ 






(78) 

(79) 



with E{p) = c\Zp^ + m?c^, p = ||p||. In terms of these, any solution x with 
positive or negative energy can be written 

x{x) = X! / (d^P)'X^^r{x){x^^r,x) , (80) 

r 

the scalar product of two solutions being defined by 

(Xi, X 2 ) = J {d^x)^{x)x°X 2 {x) . (81) 



From (80) we have the following expression of the one-particle states 

\x) = J2 f (d^P)ix^i^r,x)bl{p) |0o) , (82) 

r d 

where j<()o) is the vacuum state, the correspondence x — >■ |x) preserving the scalar 
product. Denoting by the positive frequency part of 9, this last formula 
implies 

^'^■^^(a;) \x) = J2 f (d^P)x^i^rix)br{p) lx) 

= X! / ('^^P)Xp|r^a;)(Xp|r^X) l-^o) 

= X{x) |(('o) • (83) 

The matrix element (76) reduces to 



{Xf\9M(x)x°VG{x)9^~^\x) Ixi) = X/(a:)7° Vc(a;)xi(a;), 



(84) 
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so that, from (72), the amplitude (67) reads 

iXf\U^^Ht,to)\Xi) = -^ {d‘^x)xf{x)j°VG{x)xi{x)- (85) 

This formula will be analyzed in more detail in Section 5. 

4.2 S' Matrix 

The S-matrix can be defined when the external field vanishes at the limit of 
infinite time^. It is obtained from the U operator by taking the limits to — >■ — oo 
and t — 1 + 00 , and the corresponding amplitudes can then be given from (85). 
However, it is convenient here to return to the formula (67) in which the Hamil- 
tonian density is given by (71). In fact, we want to show that this expression 
can be transformed into a covariant one, the expression already given by Gupta 
[16]. 

The initial and final states being the same as in the calculation leading to 
(85), formula (67) with the expression (71) of Hint gives, for the first-order 
S-matrix, 

(X/I \Xi) = -iJ (d^x) l^^xj X^ 

+ \xf idkXi) 

- ^ (dkXf) X^Y (86) 

This expression can be simplified by integrating by parts the term containing 
the derivative dkh^^ ■ This leads to the two terms 

[{dkXj){l’'l°Xii - 7m7°7'")x* + m(7'"7°7m - 7M7°7'")('9feXi)] • 

In each of these, by introducing either of the formulas 

7S°7m - - <5^° - 7m7°7"] 

= 2[<iy-<5°7" + 7V7M]. (87) 

one can insert the derivatives in the combinations x^dkXi or dkXjl^ ^ yielding 

- tk)xi + [M(^°-7M7°)(7"5fcX.)-(9fe^7'=)(<5°-7°7M)Xi] ■ 

( 88 ) 

Adding this contribution to the remaining terms in (86), one gets 

(X/I \xi) = -* y {d'^x)\^h!^^XfX^ + ^-^h^^Xflki0k ~ %d)X^ 

\xflk-l°{l''clkXi) - {dkXfl^)l°lf^Xi] I • (89) 

Here, this condition is realized, for example, in the case of a gravitational wave. 
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Finally, using the Dirac equation in the last bracket, this expression reduces to 
(X/I \xi) = “* / {ifM - ^7m) Xi- (90) 

This formula agrees with the rule given by Gupta [16] : the expression appear- 
ing under the integral sign is, up to the sign, obtained from the interaction 
Lagrangian £ — £q in (4) by replacing S' by Xi and S' by xj^- This result is 
identical with the one valid in the case of a nonderivative coupling, where the 
interaction picture exists directly. More generally, the validity of the rule as- 
serted by Gupta, implying the use of the interaction Lagrangian defined by (4) 
and the covariant form of the propagators, can be proved at higher orders in the 
quantized theory defined in Section 3. 



5 Calculation of the Relativistic Phase Shifts 
in the Weak— Field Approximation 

In this final section we use the tools and the material derived in the previous 
sections to make an explicit calculation of the various contributions to a gravita- 
tionally induced phase shift in matter-wave interferometry. We restrict ourselves 
to one-particle or one-antiparticle states. The application of the formalism to 
many-particle states and coherent beams of massive particles of different spins 
will be developed in another publication. We assume that the incoming particles 
or antiparticles are described by the state vector |x(to)) = IXi) at some time to 
before interaction (to can be conveniently taken to be — oo). At a later time t, 
this state evolves into |x(^)) which interferes with a reference beam described by 
IXref) with which it is recombined in the final beam splitter. In practice, jxref) 
is produced by the other arm of the interferometer and in many cases, one will 
have Ixref) = \Xi) ■ 

We are thus interested in the spinorial wave function for one-(anti)particle 
states: 

Xi.x) = {4>o\0{x)\x{t)) (91) 

where 0{x), |x(^))) I0o) are respectively the free-held operator, the one- 

( anti) particle and the vacuum state vectors in the interaction representation. 
It is easily shown that, when pair creations are neglected, this expression is 
equivalent to the Heisenberg held amplitude (0,m| 0{x) |^). 

The interference signal itself is given by the projection: 

J d^xxle{{x)x{x) = J d^x{xTef\0\x)\(t)o) (</>o|6»(x) Ix(t)) 

= (XrefI G(t,to) Ix(io)) (92) 

® The contribution of the term {h/2)Co of (4) vanishes in the first order considered 
here, since x® and xj are solutions of the free Dirac equation. But this property is 
limited to the first order. 
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where the space integral is over some detection volume. More generally one 
should consider a detection hypersurface cr(x) and the projection: 

[ = f da^ (xret | ^(a;) |</>o) 7/x (<^o| |xW) • (93) 

J <7 J (7 

In this paper, we shall limit ourselves to the calculation of the amplitude: 

(Xreflx(i)) = {Xre!\U{t,to) |x(io)) • (94) 

and take the phase of this complex amplitude as the phase contribution of the 
perturbing interaction. The resulting spinor (91) will also be derived using two 
different methods: first, in configuration space and second, in the momentum 
representation. 

5.1 Calculation in Configuration Space 

The evolution equation of the state vector in the interaction picture is 

lx(^)) = ddi{t) lx(^)) . (95) 

where the Hamiltonian Hi{t) is 

Hi{t) = J d^x 9 ^x)Vg{x)9{x) , (96) 

and where the operator Vg{x), acting on the field operator 9{x), is given to first 
order by (73) 

Vg = + ^9fc/ioj(7*7'^ - 7'^7^) 

+ *^7° [2/r°S° + dk (97) 

that we shall write: 

? /? 1 

Vo(i) = .4(.) + -d,B’(x) + ,hB>G)di = AG) + - {,nd„B’G)}^ (98) 

with: 

A{x) = ^mc^ 7 °/i°° + ^a’^^dkhoj 

B^x) = ^7° [2h°^x° + - h°°x^] ■ (99) 

From equation (91) we check that the evolution of the one-(anti)particle spinor 
is governed by the equation: 

i9BtX = -ihcx°x^djx + mc^x^X + VgX 



(100) 
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to which we may add terms corresponding to diagonal magnetic dipole and 
off-diagonal electric dipole interactions [10,12]. This equation has been used in 
references [10] and [12] to discuss all the terms that lead to a phase shift in an 
interferometer. 

To obtain the corresponding amplitude, we can start directly from the inte- 
gral form of (95): 

lx(i)) = Ix(io)) - d‘^x'0\x')VG{x')e{x') \x{t')) (101) 

So that, to first order: 

^ + i^B^{x)dj Xij 

f r 1 ^ 

rf^a;|xLf -^l^dt + h-V - -OL - h - V 

+ • V X i -h V • • a + • ft. x*| (102) 

which follows also from Gupta’s form. We have used the definitions: 

a = 7°7= ^),ft = {ftO'=},ft = {ft*^}. (103) 

The calculation of the spinor itself, by a stationary phase method in configuration 
space, is outlined in Appendix C. 

In equation (102), the first three terms lead to the familiar phase shifts of the 
Linet-Tourrenc formula [23], the fourth term is the spin-rotation interaction and 
the last two terms ensure hermiticity. But, because of the Dirac matrices, the 
interpretation of the various terms in configuration space is not so transparent 
and, in previous works non-relativistic limits have been taken either directly as 
in our own work [12] or through a Foldy-Wouthuysen transformation as in [13] 
in the case of inertial fields, to put equations (100) and (102) in a form where 
the significance of the terms is more obvious. 

In this paper, we may as well take advantage of the fiat Minkowski space-time 
and, therefore, we will use rather the momentum representation in the following. 
As we shall see, the interpretation of the terms is then much easier, even in their 
relativistic form. 

5.2 Calculation in the Momentum Representation 

The free-field operator 6 is written as before as: 

J Mp)x^i^Kx) + dl{p)x^~J (x) , 



(104) 
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where br{p) and dr{p) are the annihilation operators for the particles or antipar- 
ticles, respectively, and are the positive or negative energy solutions of the 
free Dirac equation given by (78) and (79). 

Let us introduce the Fourier transforms A{k,t), B^{k,t), h^^{k)-. 



A{x) = I d^k A{k, (105) 

B\x) = I d^k (106) 

= (2^ / (107) 

VG{x)e^^-^/^ = J d^k VG{k,p,t)e^'^-^e^^-^/^ (108) 

VG{k,p,t) = A{k,t) — B{k,t) ■ {p + ^kk) . (109) 



Let us expand x(x), Xref(a;) and Xi{x) in plane waves using the expansions of 
0(=*=)(a;) and For particles, the output spinor is: 

X{x) = {(l)o\0^+\x) |x(t)) 

= (0olXl / d^P^r{p)x^j^){x) Ix(t)) 

= (2^^ t J l*(0) (110) 



with |lp,r) = bl{p)\(f)o) and with, using again (101), 
(Xreflx(t)) = (XrefI tf(t,to) IXi) 

= (Xref IXi) ~ d'^x'xl^f{x')VG{x')Xi{x') 

i 1 



(111) 



= (Xref IXi) - ^ (27rfi)3 ^ J J J '^V(Xref |lp.r) 



E{p) 

I mc^ 



)t(p)p-i(P-m'-S(p)t')/'i L 



(27t)3/2 



d^kVG{k,p' 

(112) 

= (Xref IXi) - dt' j d^pj j^2^)3/2 ^^ref |lp+fifc,r) 



d^k 



y E{p+hk)^ E{p) 



-I- hk)VG{k,p, ^p) 



( 113 ) 
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and similar expressions for antiparticles. In many cases \xi) and |xref) can be 
conveniently taken as plane waves, but it is usually more interesting to consider 
wave packets. Replacing |xref) by |lp,r), we check that the momentum represen- 
tation \x{t)) of x(a^) satisfies: 




mc^ 

E{p — hk) 



— kk, t)u^'' \p — hk) 



This equation is, in momentum representation, the analogous of equation (100) 
in configuration space and we shall give its first-order solution later. It leads to 
a discrete set of coupled equations for a fixed or negligible recoil momentum. We 
illustrate below, in the case of the scattering amplitude, how the matrix element, 
which appears in the second member of (113) and (114) can be evaluated. 

To first order, the scattering amplitude (113) is: 



(X reflX^^H^)) 

= {Xref\U^'^\t,to)\Xi) 



“ r , r ' “'*0 



dt' / d^p 



d?k 



: (Xref I ^p+hk,r) ^ 



mc^ 



E{p+ hk) 



(27t)3/2 

+ hk)Va{k,p, t')u^'~ \p) 

^i[E(p+hk)-E{p)]t' 



mc^ 

E{p) 

(115) 



The next step takes benefit from the smallness of hk/mc or of hkc/E{p) to 
expand the various quantities in this expression to first order in these parameters. 
The energy E{p + hk) can be expanded in a Taylor series: 

E{p + hk) = E{p) + + ^2 E{p) + • ■ ■ = + hk-v + hS+ , 

(116) 



where S is the recoil shift. From the general transformation law of spinors in 
Lorentz boosts (see for example [26]): 



u{p) 




n • a sinh ( — 



(f)]»(0). 



(117) 



where n is the unit vector along p, and tanhi^ = |p|/( 7 toc), we derive the 
following infinitesimal transformation for spinors: 



, - , I , n - a , n - 6p 

u(p + op) = < 1 H tanh p 

2 p 

V 2 / p 



1 

+ 2 

• E 



sinh p 
u{p) 



a - Sp _ , n • Sp) 

(n-a) 



(118) 
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we have to first order in hk/mc: 

u^{p+ hk) = M^(p){l + . Q, _ 



” X “ X}, 



27 me 2(7 + 1) me me 



(119) 



where 7 = 1/a/1 — fc = fey + 7fej_, the indices || and _L designate vector 
parts respectively parallel and perpendicular to p. The term proportional to a 
represents a boost (velocity change) and the term proportional to S a, rotation 
(Thomas precession). 

Rather than to calculate directly 



y E{p+hk)^ E{p) 
it is simpler to calculate first the matrix element: 



+ hk)Va{k,p^ t)u^'' \p) 



(120) 



me^ 



me 



E{p + \hk) y E{p — \hk) 



j— — \hk)VG{k,p - \hk,t)u^'" \p- \kk) 



me 

W) 



me 



E{p) 

1 mc^ 



m'^’\p + \hk) \ A{k, t) — B{k,t) ■ pj m*-’’ 7 p — \hk) 
u^^'^^{p) [x 4 (fc, t) — B{k, t) ■ p] ^(p) 



( 121 ) 



47 E{p) 



I — • a, A{k, t) — B{k, t) ■ pi \p) 
( me j _ 



me 



4(7+1) E(p) 



,(r)t 



(P) 



— X — • B, A(k, t) — B{k,t) ■ p 
me me 






(P) , 
( 122 ) 



where {x 4 , B}, designate (anti)commutators. The first line gives: 



(p) U (fe, t) - B{k,t) ■ p\ u^’'">(p) 
E(p) L J 



E{p)hoo 



— X h) ■ + 'ycr\\)w^^ ^ 

= OF/ 5rr' - ^(fe X ^) • , (123) 

2 L(p) 47 

where are Pauli two-component spinors corresponding either to helicity 
eigenvalues or to the two values of the z-component of the spin in the rest 
frame, and where 



— eph 



2 E{p) 



p ■ h ■ p 



a = (cr_L + 7(T||) 



(124) 
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is the spatial part of the Thomas-Pauli-Lubanski 4-vector operator [30,31]. The 
second line gives the term: 



ihc^ 



-{k X h ■ p) ■ aw^'' ^ 



4E{p)j 

The last line gives the Thomas precession terms: 



(125) 



4(7+1) E{p) 
ihhoo 



* -i:, A(fc,t)-B(fe,t)-p[> m("')(p) 

' me me 



(k X p) ■ aw^'^ ^ ^ (fc X p) ■ ^ 

^ ’ 2m{-t+l)E{pY 



4to(7 + 1) 
ih 



{k X p) ■ h ■ p 



4E'^{p)m{j + 1) 

The last line can be rewritten to yield: 






(126) 



E{p + \hk) y E{p - \hk) 



u^'^^Yp + ^hk)VG{k,p — \hk, t)u^'" \p — \hk) 



2 V 2 

2E{pf "" 47 L 



k X [ h — h • 



pe 

E{p) 






ih 



2m(7 + 1) 



(fe X p) 



c^P^h^.^p'' 

2E^{p) 






(127) 



If we replace now p by p + fi.fc/2 in order to calculate (115) this introduces 
the additional terms: 



hc^k ■ P I r c^P ■ h ■ p \ hk I ~ h ■ pc 

E^p) -"T' 






(128) 



If we introduce the 4- vector k^: 



K^c = k ■ V 

K = k 



(129) 

(130) 



which corresponds to the energy-momentum 4-vector exchanged during the in- 
teraction, these terms can be rewritten: 



2E{p) 



„r 1 r 

-P^hf,^P + -^h^^p \ Sr 



( 131 ) 
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Our final result is thus, for the scattering amplitude: 

(Xretl Ix(^o)) 

d?k 



= --^ / dt' I (d^p) 

rr' 



' to 



(27t)3/2 

I — 

\2E{p) 



(Xret I ^p+hk^r) 



{p^ + Hk>^) h^^p'' i 1 - 



Hk°c 



ih 



2m(7 + 1) 

ihc 



(fe X p) 



2E{p)J 
2E^{p) 



\ <5™- 






47 



k X ( 

E{p)J 









(132) 



To obtain the outgoing spinor, one can replace (xrefi by ((/)o| 0(x) in the previous 
expression, which gives this spinor as a sum of outgoing plane-wave spinors: 



X{x) =X»(a:)- Lo_i ../9 






d^k 



(2^)3/2 V E{p) 









ih 



2m{x + 1) 

ihc 



{k X p) 



c^pf^hf.^p'' 






■ fcx 

47 [ V E{p)J\ 



2E^{p) 

\ E^^^PEEip-nk)^ 

(133) 



in which an explicit phase factor is associated with each outgoing plane wave 
component and which can also be obtained directly from the first-order solution 
of equation (114). 

One can also perform the calculation presented in Appendix D, which gives 
this spinor in the form of Dirac matrices multiplying the initial spinor plane 
wave components 



X(a^) = X*(a^) - 7 777^772 H f (d^p) 



d^k 






' to 



(27t)3/2 2E{p) 



{pP + hnP) h^^p'' ^1 - - y 



(134) 



^ik.x^i[E{p+hk)-E{p)]{t' -t)/h ^p^^t{p-x-E{p)t)/h^^^ 



In this formula, the integral over p can be calculated by assuming that the initial 
wave packet has a very narrow width in momentum space around a central value 
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Po- The initial wave packet Xii^) can then be factorized. If this approximation 
is not sufficient an expansion of the wave packet around po can be used [22]. 
The k integral can be performed by turning each term linear in k into a spatial 
derivative and the result of Appendix C is recovered for the spinor in config- 
uration space. Finally the time integral can be worked out in many cases and 
expresses energy conservation [22]. 

For the scattering amplitude, the comparison of equation (132) with equation 
(102) shows new terms directly related to the momentum exchange: a generalized 
Thomas precession and a generalized spin-gravitation interaction. To illustrate 
how this phase shift calculation is done from equation (132) we shall rewrite this 
equation without the terms that obviously do not contribute to the phase and use 
expression (116) for the energy difference, in which we neglect the recoil shift 6 



~ dt' J {d^p)a*^f{p)a,{p)/3*.^^f|3r',^l 2g( + vt',t')p’'Sr 

/p p/ ^0 ^ c / 

h \c'^pf^Vhf,^{xo + vt',t')p'^ 1 (^).|. (p) 

'X — 7 TV , xp ’ 13! 

2to(7-|-1) [ 2E^{p) \ 

V X ^h{xo + vt' , t') - h{xQ + vt' , t') ■ ^ , 



where we also made explicit the centers of the wave packets and their polariza- 
tion: 

(Ipv lx^) = (x,ef| lp+nk,r) ^ (136) 

(the assumption that the wave packet is broad enough for a*^^{p+hk) ~ a*^^{p) 
has been made). 

If we assume, for simplicity, that the reference wave packet is identical to 
the unperturbed wave packet: (p)/3r.^ref = oti{p)j3r,i and that they are very 

narrow in momentum space around a central value p, the phase simplifies to: 

7 ' c^pf^Vhfj,^{xQ + vt',t')p'' 
m( 7 -|-l) _ 2E'^{p) 

^ + h{xo + vt',t ')- -sj 




(137) 
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where s is the mean spin vector® 

s = /2j (139) 

r,r' 

In fact the phase calculation is usually more involved since the previous for- 
mula applies only to the case of straight unperturbed trajectories. In practice 
however, one cannot always ignore the fact that, when calculating the phase 
to first-order for a given term of the Hamiltonian, the motion of the particles 
is affected by other terms. One example is the calculation of the gravitational 
shift within the atom beam splitters, in which one cannot ignore the important 
effects of the diffracting electromagnetic field on the trajectories of the particles 
[10,38-40]. Gravitational phase shifts have to be calculated along these trajecto- 
ries. Another example is the gravity field itself, which, on earth, gives parabolic 
trajectories for atoms. The phase shift for the other terms in equation (137) has 
to be calculated along these parabolas. A convenient way to achieve these calcu- 
lations is to replace Xq + vt' and v in equation (137) by the classical trajectory 
{a;(t'), u(t')} obtained in the ABCD formalism developed in references [22,37]. 

Expression (137) displays all the terms which may lead to a gravitational 
phase shift in a matter-wave interferometer. They are summarized in Table 1 
where one finds successively: 

• the terms involving /iqo lead to the gravitational shift {hoo = —‘2 g ■ rjc^), 
to shifts involving higher derivatives of the gravitational potential and to 
the analog of the Thomas precession (spin-orbit coupling corrected by the 
Thomas factor). 

• the terms which involve h = give the Sagnac effect in a rotating 

frame (h — xr/c), the spin-rotation coupling and a relativistic correction 
(analogous to the Thomas term for /iqo)- They describe also the Lense- 
Thirring effects coming from inertial frame-dragging by a massive rotating 
body, which is a source for h. 

• the other terms, which involve the tensor h = {fi*®} describe genuine Gen- 
eral Relativity effects such as the effect of gravitational waves and de Sitter 
geodetic precession (which also includes the Thomas term for hoo). 

Our expressions are valid for spins 0 and 1/2 and may be conjectured to be 
valid for arbitrary spin if cr/2 is replaced by the corresponding spin operator S. 

The reader will find calculations of the phases corresponding to these vari- 
ous terms in references [3,7,10,35,41-44]. In these calculations, one should never 

® More generally, if we use ai{p,r) instead of ai{p)(3r,i, the mean spin vector should 

be written: 

s = {Xi\^ j {d^x)e\x)Se{x)\xi) = ^ j {d^p)at{p,r)ai{p,r')hw^^'>^aw‘'^^/2y 



(138) 
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forget that the external field acts not only on the atoms but also on other 
components of the experiments, such as mirrors and laser beams and that, de- 
pending on the chosen gauge, additional contributions may enter in the final 
expression of the phase which should, of course, be gauge independent. 

In present experiments on the earth gravity measurements, the relative sensi- 
tivity is 8gjg ~ 3.10“® after 60 seconds and the absolute accuracy 5.10“®[6,35]. 
For rotations, the best sensitivity achieved up to now is 6.10“^®rad.s“^Hz“^/®[36] 
but these numbers are expected to improve rapidly in the near future, espe- 
cially in space experiments, in which general relativistic effects should become 
detectable. An accurate measurement of the effect of gravitation and inertia on 
antimatter also appears as a possibility already discussed in reference [47] with a 
transmission-grating interferometer, although we believe that an antiatom inter- 
ferometer using laser beams for the antihydrogen beam splitters (Ramsey-Borde 
interferometers) would be better suited for such an experiment. The formalism 
introduced in this paper to deal with antiatoms should be useful to discuss such 
experiments, especially when coherent beams of antihydrogen will be produced 
either by Bose-Einstein condensation and/or by stimulated bosonic amplifica- 
tion. 

5.3 Analogy with the Electromagnetic Interaction 

The formulas that we have derived for the transition amplitude and for the 
ongoing spinor strongly suggest analogies with the electromagnetic field case. 
To emphasize these analogies, let us introduce the following pseudo-potential 
4- vector 



A>^ = , 


(143) 


then the Linet-Tourrenc term, which appears also in the generalized Thomas 
precession is simply 




(144) 


where is the 4-velocity p^/m. 

The corresponding field 




^ ^1/ = i = (E/ c, E) 


(145) 


More rigorously, one should introduce [33] 

A'" = + fi-Ki^/2) , 


(140) 



which stems directly from a compact form of the interaction Hamiltonian 
Vg = + h.c. = I 

with p® = —a’pj + 'f' rac and pj = ihdj 



(141) 

(142) 
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appears in the outgoing spinor (135) through 

and in the generalized spin-gravitation interaction term 

hc^ 



ihc 

47 



k X ( 

E{p)J 






2jE{p) 



B . (146) 



This new correspondence between the gravitational interaction and the electro- 
magnetic interaction generalizes the so-called gravitoelectric and gravitomag- 
netic interactions introduced by de Witt [45] and Papini [46] . 

Appendix A: Dirac Equation in Curved Space— Time 

For a given space-time manifold, together with its metric tensor the La- 
grangian density of the Dirac field reads [14] 



r ^ 



E — (147) 



with 



= 



ot — ^oc 






, = 






(148) 



In these formulas the ea’s are four-vector fields constituting an orthonormal 
tetrad (tetrad indices have a hat), that is satisfying the relations 



Hpi/ — Va/3j 



(149) 



with (rjag) = diag{+l, -1,-1, —1). A change of tetrad is possible according to 
the formula 

= (150) 

where A is a matrix of the Lorentz group defined at each point of the spacetime 
manifold. In such a change the Lagrangian density (147) is left invariant if the 
Dirac field is correspondingly transformed according to the law 



E' = S{A)-^E, 



(151) 



the matrix ^(A) being the usual transformation matrix under a Lorentz trans- 
formation of a Dirac spinor [17]. For an infinitesimal transformation 



AqlP VolP “t“ ^OL 01 



^ocf3 — ^ (3o 



(152) 



one has 



S'(A) 1 = J -H -e„/3Cr“^. 



(153) 
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The field equations read 

— W = 0 , S' —i^al^^ — Tnc/h =0. (154) 

With the invariance of C under the phase transformations S' — >■ — >■ 

is associated the current density 

f = j^ = ce% (155) 

where is a four-vector invariant under a change of tetrad, and one has the 
conservation relation in either one of the two equivalent forms 

V = 0, V^J^ = 0. (156) 

Appendix B: Weak-Field Approximation 

It is now assumed that space-time admits a coordinate system (x^) in which the 
metric tensor takes the form 

9/11^ ~ Vfii^ 4” ^ 1* (15"f) 

According to that hypothesis, the /i^i/’s will be considered as first-order quan- 
tities, and the subsequent calculations will be valid at this order. To determine 
the corresponding form of the Lagrangian, it suffices to construct the tetrads 
associated with (157). By putting = rjan + fafj., where /q^ is of the first 
order, one obtains from (149) and (157), 

fa/3 4” f /Sa — (158) 

The general solution of these equations is of the form 

fa0 — 2^aj3 4“ ^ol( 3 i (159) 

in which the Cq-^j’s are first-order quantities only restricted by the antisymmetry 
condition Cap = —ipa- One then has 

^ctfi Vafi 4- 2^0^11 4- (159) 

Introducing the free Dirac Lagrangian (2) and the associated stress-energy tensor 
(3), the Lagrangian density (147) calculated at the first order and corresponding 
to (160) reads^^ 

£ = £o - - y 

(161) 



11 



The indices of and are raised with the help of . 
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The corresponding field equations are the following 

0 = [(1 + ^ - mc/h) - 

+ '-{d^h - d^h'' , (162) 

0 = - mc/h)(\ + \h) + 

-\{d^h - d,h\)r - . (163) 

The terms depending on are identical to those appearing in (5) and (6). The 
same is true for the current density, which is now 

f = (164) 

Considered independently of the context, the equations (162) and (163) are 
invariant under the transformations of the Poincare group provided that the 
and the e^j^’s are transformed like the components of a second-rank tensor, 
and W by the corresponding transformation law of a spinor. However, the weak- 
field character of is not conserved by any finite Lorentz transformation except 
by the rotations. Moreover, the ft-^i/’s being the basic quantities, the condition 
that the e^j^’s are of first order is naturally interpreted by assuming that these 
latter are some linear functions of the former, that is one has 

= afn.pahP'^ . (165) 

Such a relation is not compatible with the general tensor transformation law, 
on account of the symmetric or antisymmetric character of or e^^,, but can 
be made compatible with the rotations by a suitable choice of the coefficients 

^liupa • 

The rotational invariance of (165) is equivalent to the relations 

^mT'pV' ^ r/ r/ ^ (166) 

where R can be any rotation matrix. The corresponding solutions are given by^^ 

P^0^0J ^ ^ ^0^Jk ^ Q ^ ^JkO^ ^ ^ ^ijkl ^ Q ^ 

where A and B are some arbitrary parameters. If, in addition, the invariance 
under parity is postulated, one has to take B = 0, giving 

6oi — Cjo — Ah^i , Cij — 0 , (168) 

then, instead of (160), 

^QO ^ 2 ^00 ’ ^Oi (2 ’ ^0 ^ 2 2 * 

(169) 



As usual, the latin indices can take the values 1,2 or 3, and is the completely 
antisymmetric symbol. 
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In particular the choice A = 0 corresponds to the tetrad 



— 2 ^ 



2 ' ^OifJ, 5 



(170) 



which we call the standard tetrad associated with h^,y. The corresponding La- 
grangian is identical to the Lagrangian (1), therefore the equations (5) and (6) 
are recovered from (162) and (163) by letting = 0. We will continue to 
designate the corresponding field by W. 

If we write explicitly the spinorial connection in this weak-field approximation 









(171) 



the equation for d/ can also be written in a simple form, analogous to the elec- 
tromagnetic case, and which will find an interpretation in terms of gauge fields 
in the flat space-time approach of the main text 

-^W = 0. (172) 

Another choice of tetrad can be made in relation with the expression of the 
vector current. From (155) written in the weak-fleld case, one has in general 



and then 



jA'/c ={ 1 + 



f/c ={1+ 



(173) 

(174) 



This quantity will be proportional to the usual density if one has = 

— which is obtained with the choice A= Denoting by 'P' the correspond- 
ing held, one has^^ „ , . „ 

/ = c(l + . (175) 

It is then possible to introduce the new held 0 by [12] 



such that 



0=(1 + U^)5tf/''~(1+U^)r, 



f = c©7°0, 



(176) 

(177) 



as in the free-fleld case. The change of held ^ ^ P' corresponds to a change 
of tetrad as defined above, so that, from (160) and (170), one has the formulas 
(151), (152) and (153) written with an infinitesimal parameter e equal to — e 
that is such that = 0, £oi = ~£io = This gives 

<P' = {I - (178) 



then, from (176), 



0=(/+i/i-iV7V)tf'- 



(179) 



The held 0 is identical to the held introduced in Section 3.2 for the purpose of 
defining the quantization and the interaction picture. 

Let us note that, in the expression of the conserved charge corresponding to (175), 
the volume element (1 -I- i)d^x is, in the first-order approximation, the 3-volume 
associated with the spatial metric defined from the coordinate system (x'^)[20]. 
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Appendix C: A Stationary Phase Calculation 



The outgoing spinor can be calculated directly in configuration space with the 
help of a stationary phase formula. In the particle case, from the definition 



X{x) = {(l>o\0^^\x)\x{t)) 



(180) 



and the integral equation 



\x{t)) = Ix(to)) {d^x')e^+'>Hx')VG{x')e^+\x')\x{t')) (isi) 

one can derive, to first order, the expression 

x{x) = Xi(x) - ^ y {d'^x'){(l)o\e^+'>{x)e(+'>{x')\(t)o)j°VG{x'){(t)o\0^^\x')\xi), 

(182) 

which, with the help of standard formulas, can be transformed into 
X(x) = Xi{x) - ^ f d‘^x'S‘^^\x - x')x°VG{x')xi{x') 



(183) 



= Xi{x)- J ^^P^ptr(x) J {d‘^x')xitJ{x')x°VG{x')xi{x'). (184) 

By introducing (85), one finds 

X{x) = Xi{x)~ / d^PX^j^Kx) ihc{x^+)\U^'^\t,to)\xi)- (185) 

In this last expression the matrix element can be submitted to the same trans- 
formations as those leading from (86) to (90), yielding 

X{x) = x^{x)-iY^ j d^px^+}{x) f d'^x' ^/r^^(x')Xp’!'r (a^O {ifM. ~ ^7m) Xi{x') 
= X^{x)-^^ d'^x'hf^^{x')S’'+\x-x')(jJ^^-'¥^X^^Xi{x')- (186) 



By introducing the expression 
S^^\x — x') = 



d?p 



(7^p^ -h 



2{27ThfJ E{p) 
one finally gets for a plane wave with the momentum 



\po^-\-E{p)/c 



, (187) 



rt 



x(x) = X^ix) - J dt' j d^x'h>^''{x') X 



1 



{27rhrJ E{p) 



+ + mc)e-*^'("-"')/y^Xi(^') ■ (188) 




434 



Ch.J. Borde, J.-C. Houard, and A. Karasiewicz 



The elementary stationary phase formula^^ applied successively to the inte- 
gral over p, then to the integral over x', yields the Linet-Tourrenc result [23], 
namely : 



X{x) 



1 



POfiPou 

2h E{p^) 






'to 






Xi{x) , 



Vo 



Pqc^ 
E{po) ' 
(189) 



This expression is the beginning of an asymptotic expansion in powers of h of 
the form 



X{x) 



dt'F{x-vo{t-t'),t') 

to 



Xi{x), 



(190) 



in which F is defined by an expansion in integer powers of h whose first term 
corresponds to the Linet-Tourrenc formula. This expansion can be obtained from 
a generalization of the stationary phase formula given by Hormander [25]. The 
application of this general formula to the two integrals appearing in (188) yields, 
after some complicated calculations, the following expression of the outgoing 
spinor (to be derived in the next Appendix by a simpler method) 



F = 



Pfj-Piy 






ih 



F{p) 2F{p) 
F{p) 



+ pAh^" - Pm- 



F{p) 



{p^p.hA 

c c ' 



(191) 



J J p=po 



Appendix D: Derivation of the Wave Function 
Using the Momentum Representation 

The method used for the amplitude in the main text can be used to derive the 
wave function. The spinorial wave function for one-(anti)particle states is 

X{x) = {A 0{x) \x{t)) . (192) 



For particles: 



(0o| 6»(+^(x) \x(t)) = (A 9^~^\x) Ixi) 

i 1 f ,3 f mc^ 

~h{27:hA^Jt/ j ( 27 t ) 3/2 F{p + hkf 



(193) 

(p + hk)A''\p + hk) 



VG{k,p,t')F'^-^eAip+f^’^)-E{pW-t) 




)(p)es(p— s(p)*)(lp_,, 



lx(0) 



14 



See, for instance [24]. 
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and a similar formula for antiparticles. To first order we get 



(194) 



i 1 






^ / dt' / 



d^k 



me 



(27t)3/2 E{p+hk) 



(p + {p+ hk) 



V ^\p) 

The next idea is to express the propagator which appears in this equation in 
order to write the outgoing spinor in the form of Dirac matrices multiplying the 
initial spinor plane wave components. 

From 



1 



^ u'^''\p)u^''\p) = + me] 



one obtains 



E 



-u^‘^\p + hk)u^'''’\p + hk) = 



(195) 



E{p + hk) 



(Pm + + mc][l- 



hn^c \ 



2E{p) ' ' ^ 

from which, after some algebra, one finds the spinor 



E{p)) 



(196) 



X^^\x) = - ^ 



i 1 



h (2^ft)3/2 E J j (2^)3/2 2E{p) 



hhbe \ ih 



{p^ + hKb h^.p’' ^ j - ^-EnpaP^h^^p^^ 



(197) 



^ik.x^i[E(p+hk)-E(p)]{t'-t)/h ^(r') ^p'^p,iip-x-E(p)t)/h^^ 

V E{p) 

where the time-dependent exponential can be expanded to any desired order for 
recoil shift corrections: 



^i[E{p-\-hk) — E{p)]{t' — t) /h 



^ik-v{t' —t) 






mp) 

Finally one can check that: 

(XrefI Ix(^o)) = [ d^x xLf(a^)x^^^(a;) • 



(198) 

~ [1 + i5{t' - t)] . 



(199) 
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We obtain indeed 






\ [E{p+hk) — E(p)]t' 



y{ (Xrefl lp+?ifc,r) 



E{p + hk) 

{p^ + hn^) h^i.p'' ( 1 
with 

^ mc^ c- 



2E{p) 



Hk°c\ ih ~ 



> mc^ 
E{p) 



(200) 

W^’’'^P)(lp.r-' \Xi) , 



E{p+ hk) 



uP^’{p + hk) 



2E{p) 



~ / fiK^r\ h ~ 

{p^ + hK^) hf.^p"' f 1 - - i^KpaP’'h^^p^ 






E{p) 



2E{p) 



+ hn^) hp^p’' 1 - 



hK°c 






ih 



•u;Mtay;(d) _ 



ihc 

47 



k X \ h — h 



2E{p) J 2m(7 + 1) 

pc 



{k X p) 



( 201 ) 

c^P^hp^p'' 

2E^{p) 



E{p) 






from which equation (132) is recovered. 
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Abstract. In these two talks, we report on the efforts to probe the role of spin and 
polarization in gravitation. After reviewing the motivation and historical background, 
we focus the talks on the experimental searches. These experimental searches are 
mainly of two categories: (i) laboratory searches (torsion-balance experiments, mag- 
netic resonance experiments, SQUID experiments), and (ii) astrophysical and cosmo- 
logical searches (pulsar observations, radio-galaxy observations, gamma-ray observa- 
tions). We first discuss experimental searches for photon polarization coupling and then 
discuss experimental searches for electron spin-coupling. In the discussion of photon 
polarization coupling, we review the astrophysical and cosmological electromagnetic 
propagation observations. In the discussion of electron spin-coupling, we review the 
weak equivalence principle experiments, the finite-range spin coupling experiments, 
the spin-spin coupling experiments and the cosmic-spin coupling experiments. We dis- 
cuss two recent laboratory experiments, a SQUID experiment and a torsion-balance 
experiment in detail to illustrate the experimental techniques. The ultimate searches 
for the role of spin in gravitation is to measure the gyrogravitational ratio. A discussion 
of the strategies to perform such experiments conclude these two talks. 



1 Introduction 

According to our present understanding of physics, particles and fields transform 
appropriately under inhomogeneous Lorentz transformations. These inhomoge- 
neous Lorentz transformations form a group called the Poincare group. The 
only invariants characterizing irreducible representations of the Poincare group 
are mass and spin (or helicity in the case of zero mass). Both electroweak and 
strong interactions are strongly spin-dependent. The question comes whether the 
gravitational interaction is spin-dependent (polarization-dependent). In this pa- 
per, we review the searches for the role of spin in gravitation. The gravitational 
interaction is the earliest formulated interaction. Both Newtonian gravitation 
and Einstein’s general relativity are universal-interaction theories about masses. 
There are no (direct) polarization-dependent effects in these theories. Histori- 
cally, these theories were formulated before spin was discovered. Ever since the 
existence of spin (quantum spin) was noticed, people started to propose possible 
polarization-dependent effects in gravitation on various levels. If there are spin- 
dependent effects in gravitation, Einstein’s Equivalence Principle (EEP) would 
be violated at a certain appropriate level. Since mass and spin (helicity) are 
two independent invariants of the Poincare group, there is the question whether 
the gravitational interaction between masses and the “gravitational” interac- 
tion between spins have the same coupling constant. If the strengths of coupling 
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are different, then the question comes whether we shall call the spin-spin or 
spin-mass interaction gravitational. This question can only be answered if the 
strengths are determined and a working theory (e.g., superstring theory) is for- 
mulated and adopted. From a phenomenological approach, we ask whether there 
is a long-range (or semi-long-range) spin interaction (in addition to electromag- 
netic spin interaction) and what is its strength and interaction form. Therefore, 
in reviewing the experimental searches, we include the related efforts. 

In Section 2, we review the theoretical relations among the equivalence prin- 
ciples and the way leading to the axial interaction. In section 3, we discuss 
spin and gravitation emphasizing torsion and gyrogravitational ratio. In Section 
4, we articulate on the inter-relations among spin, equivalence principle and 
(semi-)long-range forces. In Section 5, we review the experimental searches for 
photon polarization coupling and related tests of EEP. In Section 6, we review 
the experimental searches for electron spin-coupling; we discuss weak equiva- 
lence principle experiments, finite-range spin-coupling experiments, spin-spin 
coupling experiments and cosmic spin-coupling experiments in four subsections. 
In Section 7, we present an outlook. 



2 WEP, EEP and the Axial Interaction 

Equivalence principles [1,2] are cornerstones in the foundation of gravitation the- 
ories. In the theoretical study of the foundation problems, to what extent the 
Galileo weak equivalence principle {Universality of free-fall trajectories, (WEP 
I)} implies the validity of the Einstein equivalence principle (EEP) is an im- 
portant issue. Schiff [3] conjectured that the Galileo weak equivalence principle 
implies the Einstein equivalence principle. In 1972, I started to investigate this 
issue and reached a counterexample of Schiff ’s conjecture [4]. In order to find 
out to what extent the violation occurs, I followed up using a general framework 
— the X ~ 9 framework to study Schiff’s conjecture and theoretical relations of 
various equivalence principles [5,6]. 

The X ~ 9 framework can be summarized in the following interaction La- 
grangian density 

Li = - Akf{-gY/'^ - T’/TO/'^(5(x - x/) (1) 

IGtt at 

where = — x^^^* is a tensor density of the graviational fields (e.g., 

gij,(t>, etc.) or fields to be investigated and j^, F^ = Aj^i — Aij have the usual 
meaning. The gravitation constitutive tensor density dictates the behavior 
of electromagnetism in a gravitational field and has 21 independent components 
in general. For a metric theory (when EEP holds), js determined completely 
by the metric g''^ and equals {—gY^‘^{\g^^gA — \g'^^g^^). 

We proved that for a system whose Lagrangian density is given by Eq. (1), 
WEP I holds if and only if 

Xbfci = (_^)l/2 [ lg^kgn _ yUgk, ^ ^ ^ 



(2) 
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where (/) is a scalar function of the gravitational fields or fields to be investigated, 
and with 



{ 1, if {ijkl) is an even permutation of (0123) 

— 1, if \ijkl) is an odd permutation of (0123), (3) 

0, otherwise. 

If ()) yf 0 in (2), the gravitational coupling to electromagnetism is not minimal 
and EEP is violated. Hence WEP I does not imply EEP and Schiff’s conjecture 
is incorrect [4-6]. However, WEP I does constrain the 21 degrees of freedom of 
X to only one degree of freedom (<()), and Schiff’s conjecture is largely right in 
spirit. 

The theory with (/> yf 0 is an axial theory with important astrophysical and 
cosmological consequences (Section 5). This is an example that investigations 
in fundamental physical laws lead to implications in cosmology. Investigations 
of CP problems in high energy physics leads to a theory with a similar piece of 
Lagrangian with (p the axion field [7-12] (Section 5.2). 

In the nonmetric theory with (</> yf 0) given by Eq. (2) [4-6], there are 
anomalous torques on electromagnetic-energy-polarized bodies so that different 
test bodies will change their rotation state differently, like magnets in magnetic 
fields. Since the motion of a macroscopic test body is determined not only by 
its trajectory but also by its rotation state, the motion of polarized test bodies 
will not be the same. We, therefore, have proposed the following stronger weak 
equivalence principle (WEP II) to be tested by experiments, which states that 
in a gravitational field, the motion of a test body with a given initial motion 
state is independent of its internal structure and composition (universality of 
freefall motion) [5,6]. Therefore, in this framework, the imposition of WEP II 
guarantees that EEP is valid. 

Before mid-seventies, the actual weak equivalence experiments were per- 
formed on unpolarized bodies. These experiments constrained only 2 degrees 
of freedom of x- Only when experiments are performed on polarized bodies with 
various electromagnetic energy configurations, can they constrain the other 18 
degrees of freedom. This situation motivated us to study other existing and 
potential experimental and observational evidences for EEP, and to perform 
experiments on polarized bodies and to search for spin-dependent forces. 

Our efforts to test equivalence principle for polarized bodies [13] and to 
search for spin-dependent forces [14-17] led us to search for axion-like forces also 
[18,19]. We discuss these searches in Section 6. Since the electromagnetic energy 
contents of laboratory polarized-bodies and unpolarized-bodies are small, other 
experimental and observational evidences are crucial in laying the foundation for 
the Einstein equivalence principle. In Section 5, we discuss the constraints from 
these evidences and how to generalize the x~9 framework to give a more general 
framework for testing the foundation of relativistic gravity including microscopic 
phenomena. 
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3 Spin and Gravitation 

Ten years after the discovery of general relativity, in 1925-26, Goudsmit and 
Uhlenbeck [20] introduced our present concept of electron spin as the culmina- 
tion of a series of studies of doublet and triplet structures in spectra. From the 
very beginning of its discovery, spin remains a microscopic object. One way to 
incorporate spin into the classical general relativity is to treat the aggregate of 
spins as ordinary angular momentum. This is a standard way to extend general 
relativity. 

However, as we know, for the electromagnetic interaction, the gyromagnetic 
ratios of elementary particles are different from one, and these ratios reveal the 
inner electromagnetic structures of elementary particles. What would be the 
gyrogravitatinal ratios of elementary particles? If they differ from one, they will 
definitely reveal the inner gravitational structures of elementary particles. These 
will give clues to the microscopic origin of gravity. 

The Stanford Orbiting Gyro Relativity (GP-B) experiment to be launched 
in 2001 is a very precise and difficult experiment [21]. It aims at detecting the 
Lense-Thirring effect. When this is done, and some of the rotating gyros are 
replaced by spin-polarized bodies, the gyrogravitational ratio will have a chance 
to be measured [22]. Microscopic experiments, when developed, may contribute 
to this very difficult task too (Section 7). 

Theoretically, since a spin 1/2 particle is the most fundamental object in the 
consideration of quantum spin, we look into its inertia effects [23] and curva- 
ture effects of a Dirac particle in the standard theory of gravitation as extended 
by Gartan, Sciama, Kibble, and Hehl et al. [24-29]. In 1921, Eddington [30] 
mentioned the notion of an asymmetric affine connection in discussing possible 
extensions of general relativity. In 1922, Gartan [24] introduced torsion as the 
antisymmetric part of an asymmetric affine connection and laid the foundation 
of this generalized geometry. Gartan [25] proposed that the torsion of spacetime 
might be connected with the intrinsic angular momentum of matter. Follow- 
ing this idea, Sciama [27,28] and Kibble [26] developed a theory of gravitation 
which is commonly called the Einstein-Gartan-Sciama-Kibble theory. Hehl et 
al. [29] extended it to a dynamical torsion theory. The inertia effects include the 
Bonse-Wroblewski phase shift due to acceleration, the Sagnac-type effect, the 
rotation-spin effect, the redshift of the kinetic energy, and the inertial spin-orbit 
coupling [23]. The torsion effects are analyzed in [31,32] and references therein. 
The curvature effects give the gyrogravitational ratio. 

The study of this problem from more basic points of view involves two ap- 
proaches: 

The first approach starts with the connection. After the formulation of gauge 
theory by Yang and Mills in 1954, many efforts have been made to bring the 
gravitation into the present gauge-theoretic framework. Yang [33] proposed his 
gravitational equation in 1974 along this line. However, there are spurious so- 
lutions [34] and the metric is postulated instead of derived. Affine connections 
correspond to gauge potentials. To be truly analogous to the present gauge 
theories, the metric ought to be derivable from the affine connection and the 
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equations of motion. To pursue this approach further, we first obtain the neces- 
sary and sufficient conditions for the existence of a metric in an affine manifold 
[35] . Now the problem comes as how to transform these conditions into equations 
of motion derivable from a variational principle. Ashtekar’s formulation [36] of 
general relativity is an approach in this general direction. 

The second approach starts from an extended framework for Dirac particle 
in a gravitational field and explores various possible relations between spin and 
gravity [37,38]. 

Metric-affine theories treat metric and affine connection more or less on equal 
footing. For recent research results, see [39,40] and references therein. 

4 Spin, Equivalence Principle, and Long-Range Forces 

The equivalence principle is an important cornerstone of universal gravitation. 
The precision of its validity puts an important constraint on gravitation theories 
and particle theories. Possible deviation from equivalence would give a clue to 
the microscopic origin of gravity or some new fundamental forces(s). In relation 
to spins, we look into polarization-dependent deviations from equivalence. In 
this respect, experiments with polarized entities play an especially important 
role. 

Particles and fields transform appropriately under inhomogeneous Lorentz 
transformations which form the Poincare group. The only invariants charac- 
terizing irreducible representations of the Poincare group are mass and spin (or 
helicity in the case of zero mass) . Gravitational interaction is a long-range mass- 
mass interaction. From a phenomenological approach, we ask whether there is a 
long-range (or semi-long-range) spin-mass or spin-spin interaction and what is 
its strength and form of interaction. Experiments on macroscopic spin-polarized 
bodies are sensitive tools to detect and study these possible interactions to a 
good precision. 

In the new general relativity of Hayashi and Shirafuji [41], the coupling with 
an antisymmetric field leads to a universal spin-spin interaction. From gauging 
a sub-group of the Lorentz group, Naik and Pradhan [42,43] proposed a sim- 
ilar interaction. Around 1980, the particle physics community began to realize 
the possible existence of Goldstone bosons and/or pseudo-Goldstone bosons. 
These bosons generate (semi-)long-range forces of monopole-monopole type, 
monopole-dipole (spin) type, and dipole-dipole (spin-spin) type, just like the 
new general relativity of Hayashi and Shirafuji [41]. The axion [7-12] is such a 
pseudo-Goldstone boson. The recent issue of the fifth force arises from the ex- 
istence of a semi-long-range coupling to baryon number/hyper-charge/lepton 
number. Attempts have been made to construct models of long-range forces in 
higher dimensional Kaluza-Klein type theories and superstring theories. An at- 
tempt to generalize Nambu-Goldstone mechanism shows that the restoration of 
a spontaneous violation of a “fact”, such as the “fact” that translation generators 
in different directions commute, implies the existence of a massless excitation 
and, therefore, a long-range force. All the above cases can be explored exper- 
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imentally by gravitation-type experiments on macroscopic bodies-E6tv6s-type 
experiments, Galileo-type (“free-fall”) experiments and cavendish-type experi- 
ments [13]. 



5 Experimental Searches for Photon Polarization 
Coupling and Tests of EEP 



In the X ~ 9 framework, for a weak gravitational field. 



( 4 ) 



where 






( 5 ) 



with 77b the Minkowski metric and all |xb)| l. In this field the dispersion 
relation for oj for a plane-wave propagating in the z-direction is 

w± = k{l + J[(Xi + K 2 ) ± V(^fi-i^2)2 + 4if2]} (6) 

where 

= ^( 1)1010 _ 2 ^( 1)1013 ^^( 1 ) 1313 ^ 

K2 = yb)2020 _ 2^(1)2023 ^(1)2323^ 

^ ^ ^( 1)1020 _ ^( 1)1023 _ ^( 1)1320 ^( 1 ) 1323 ^ 



Photons with two different polarizations propagate with different speeds V± = 
LO±jk and would split in 4-dimensional spacetime. The conditions for no splitting 
(no retardation) is = w_, i.e., 

Ki=K2, K = 0. (8) 

Eq. (8) gives two constraints on the [44-46]. 

The condition for no splitting (no retardation) in all directions gives ten 
constraints on the With these ten constraints, x can be written in the 

following form 



where H = det (H^-) [44-46]. With the null-birefringence observations of pulsar 
pulses and micropulses before 1980, these relations are empirically verified to 
IQ-14 _ I0“i6 [44-46]. With the present pulsar observations, these limits would 
be improved. Analyzing the data from polarization measurements of extragalac- 
tic radio sources, Haugan and Kauffmann [47] inferred that the resolution for 
null-birefringence is 0.02 cylce at 5 GHz. This corresponds to a time resolution 
of 4 xlO“^^ s. With a detailed analysis and more extragalactic radio observa- 
tions, (9) would be tested down to 10“^® — 10“^® at cosmological distances. The 
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electromagnetic propagation in Moffat’s nonsymmetric gravitational theory fits 
the X ~ 9 framework. Krisher [48] and Haugan-Kauffmann [47] have used the 
pulsar data and extragalactic radio observations to constrain it. The effect of <j) 
in (9) is to change the phase of two different circular polarizations in gravitation 
field and gives polarization rotation for linearly polarized light [4,49,50]. Using 
polarization observations of radio galaxies, Carroll, Field and Jackiw [49,50] put 
a limit of 0.1 on A(f> over cosmological distances. Using a different analysis of 
polarization observation of radio galaxies, Nodland and Ralston [51] found indi- 
cation of anisotropy in electromagnetic propagation over cosmological distances 
with a birefringence scale of order 10^® m (i.e. about 0. 1-0.2 Hubble distance). 
This gives A(f> ~ 5 - 10 over Hubble distance. Later analyses [52-56] did not 
confirm this result and put a limit of A(j) < 1 over cosmological distance scale. 
Further observations to test and measure A(p to 10“® is promising. The natural 
coupling strength (j) is of order 1. However, the isotropy of our observable uni- 
verse to 10“® may leads to a change of Acf) over cosmological distance scale 10“® 
smaller. Hence, observations to test and measure A(j) to 10“® is very significant. 

Eq. (9) is verified empirically to high accuracy from pulsar observations and 
from polarization measurements of extragalactic radio sources, and we can now 
look into the empirical constraints for H®1 and ip. In Eq. (1), ds is the line element 
determined from the metric gij. From Eq. (9), the gravitational coupling to 
electromagnetism is determined by the metric Hij and two scalar field (j) and 'ip. 
If Hij is not proportional to gij , then the hyperfine levels of the lithium atom, the 
beryllium atom, the mercury atom and other atoms will have additional shifts. 
But this is not observed to high accuracy in Hughes-Drever experiments [57,58]. 
Therefore Hij is proportional to gij to a certain accuracy. Since a change of 77** 
to ATJd does not affect in Eq. (9), we can define Hu = gu to remove this 
scale freedom [44,59]. 

In Hughes-Drever experiments [57,58] Am/m < 0.5 x 10“^® or Am/mf, „i, < 
0.3 X 10“^^ where me.m. is the electromagnetic binding energy. Using Eq. (9) in 
Eq. (1), we have three kinds of contributions to Z\m/me.m.- These three kinds are 
of the order of (i) {H^^^-g^j,^), (ii) {Hof_c~goii)v, and (iii) (77oo-goo)v^ respectively 
[44,59]. Here the Greek indices g,, v denote space indices. Considering the motion 
of laboratories from earth rotation, in the solar system and in our galaxy, we can 
set limits on various components of {Hij — gij) from Hughes-Drever experiments 
as follows: 

10-1® 

\Hoi^-goi,\/U < 10-13-10-14, 

|i?oo-5ool/C/ < 10-1°. (10) 

where U (~ 10-®) is the galactical gravitational potential. 

Eotvos-Dicke [60-63] experiments are performed on unpolarized test bodies. 
In essence, these experiments show that unpolarized electric and magnetic ener- 
gies follow the same trajectories as other forms of energy to a certain accuracy. 
The constraints on Eq. (9) are 



|1-V’|/U< 10-1° 



( 11 ) 
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and 

|i?oo -<700 I/C/ < 10-6 (12) 

where U is the solar gravitational potential at the earth. 

In 1976, Vessot and Levine [64] used an atomic hydrogen maser clock in a 
space probe to test and confirm the metric gravitational redshift to an accuracy 
of 1.4x10-^ [65]. The space probe attained an altitude of 10,000 km above the 
earth’s surface. With Eq. (11), the constraint on Eq. (9) is 

|//oo-<7oo|/C/< 1.4x 10-^ (13) 

Thus, we see that for the constraint on |iLoo — 9 oo\/U, Hughes-Drever experi- 
ments give the most stringent limit. However, STEP mission concept [66] pro- 
poses to improve the WEP experiment by five orders of magnitude. This will 
again lead in precision in determining Hgo. 

The theory (1) with yb** given by (2) is studied in [4] and [22]. In (1), 
particles considered have charges but no spin. To include spin-1 particles, we 
can add the Lagrangian for Dirac particles. In the next section, we review the 
experimental tests of the equivalence principle for polarized bodies. 

In the above discussions, we assume in (4) to be given by the special 

relativistic value (5). In general, is determined from cosmological model 

in a particular theory and provides a framework to test special relativity. From 
null birefringence of pulsar observations, y(°)*/** is constrained to have the value 
given in (5) to a precision of lO”^®. From the polarization measurements of 
extragalactic radio sources, the agreement to special relativity is to lO”^®. 

To include QCD and other gauge interactions, we have generalized the x~ 9 
framework [68]. Now we are working on a more comprehensive generalization 
to include a framework to test special relativity, and a framework to test the 
gravitational interactions of scalar particles and particles with spins together 
with gauge fields [38]. The relation of this generalized framework with respect 
to the Mansouri-Sexl framework [69], the Tourrenc-Melliti-Bosredon framework 
[70] and Lammerzahl framework [37] is under study. 

6 Experimental Searches for Electron Spin— Coupling 

In this section, we review and discuss electron spin-coupling experiments — 
weak equivalence principle experiments, finite-range spin-coupling experiments, 
spin-spin coupling experiments and cosmic spin-coupling experiments. An im- 
portant issue is to make a spin-polarized body. Here we describe the strategy 
and method. 

To make a polarized-body with a net spin but without net magnetic moment, 
we need both the orbital angular momentum contribution and spin contribution 
of magnetic moment so that these contributions cancel each other, with a net 
total spin remaining. 

For iron-group transition elements in a crystal, the elements are exposed 
to a noncentral electric field. In a noncentral field the plane of the orbit will 




Spin in Gravity 447 



move about; the angular momentum components are no longer constant and 
may average to zero (quenched). For iron-group elements the orbital magnetic 
moments are mostly quenched although spins may drag some orbital momenta 
along with them. 

For rare-earth elements the orbital angular momenta in the unfilled 4 f shell 
is not quenched. Therefore, rare-earth Fe, Co, Ni compounds would be good 
materials for making spin-polarized bodies. 

The experiments would be much easier to do at room temperature than at 
low temperature. For light rare-earth elements and their compounds, the Curie 
temperatures or Neel temperatures are quite low. Heavy rare-earth compounds, 
such as Tb, Dy, Ho compounds generally have much higher Curie temperatures. 
DyeFe23 has compensation temperature near room temperature. 

In the Dy-Fe compounds, magnetization versus temperature curves indicate 
artiferromagnetic interaction between iron and dysprosium atoms. From the sus- 
ceptibility data, the strengths of Fe-Fe, Dy-Fe and Dy-Dy exchange interactions 
can be derived. Fe-Fe exchange interaction dominates the others. Dy-Fe com- 
pounds are ferrimagnetic at room temperature. The effective ordering of the 
iron lattice and dysprosium lattice have different temperature dependence be- 
cause the strengths of exchange interactions are different. At the compensation 
temperature, the magnetic moments of two lattices compensate each other so 
that there is no net magnetization. Dy+++ has L = 5 and S' = 5/2 . Half of the 
Dy magnetization comes from orbital angular momentum, the other half from 
spin. Most of the iron magnetization comes from spin. So there is a net spin (and 
net total angular momentum) remaining. 

To make samples, DyeFe23 was synthesized by melting stoichiometric quanti- 
ties of metallic iron and metallic dysprosium. The DyeFe23 ingots were crushed, 
pressed into a cylindrical aluminum cup, and magnetized along the axial or a 
transverse direction direction. The magnetic field was shielded by two halves of 
pure iron casing, a thin aluminum spacer, and two sets of two fitting /i-metal 
cups with another thin aluminum spacer between the two sets. 

Measuring the magnetization-temperature curve of our sample DygFe23, 
comparing with temperature dependence curve in the literature and calculat- 
ing from the magnetic properties of DygFe23, there is at least 0.4 net polarized 
electron per atom. 

In addtion to the DygFe23 samples, we also make DyFes, HoFes, HoeFe23 
and TbeFe23 samples. 

6.1 Weak Equivalence Principle Experiments 

To investigate the equivalence principle for spin-polarized bodies or to probe 
the mass-spin (monopole-spin/baryon-number-to-spin) interactions, we have 
used both a beam balance [ 71 ] and a torsion balance [ 13 ] to test a magnetically 
shielded spin-polarized body of DygFe23. From these results, we have inferred 
that, to an accuracy of 5 x 10 “^, the polarized electron falls at the same rate 
as unpolarized bodies in the earth’s gravitational field, and that it falls at the 
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Fig. 1. Limits on cr-f spin coupling for axionlike interactions from various experiments. 



same rate as unpolarized bodies in the solar gravitational field with a deviation 
from this unity ratio estimated at (3 ± 4) x 10“^. 

At the present, we are working on a rotatable torsion-balance experiment and 
have improved the solar equivalence principle test by one order of magnitude [72] . 



6.2 Finite Range Spin Coupling Experiments 

In [73], we use torsion balance with two cylindrical copper test masses and two 
cylindrical polarized “attracting” masses to search for finite-range mass-spin 
interactions with the Hamiltonian of the form: i7i„t = f{'>')cr ■ r which includes 
the finite-range Leitner-Okubo-Hari Dass interaction with /(r) = —Ae~^^mU. 
This result shows that for the range of 3-5 cm, the upper limit of this interaction 
for our test mass and polarized mass is below 1% of the gravitational interaction. 

Ritter, Winkler and Gillies [74] use spin-polarized DysFe 23 masses acting on 
unpolarized copper masses in a dynamic-mode torsion pendulum, and searched 
for interaction of the axion [7-12,75] form 



Sninc \Xr 




e • r . 



(14) 



In (14), A is the range of the interaction, gs and gp are the coupling con- 
stants of vertices at the polarized and unpolarized particles and m is the mass of 
the polarized particle. Constraints on the coupling gsgp/hc with respect to the 
range from various experiments are plotted in a logarithmic plot (Fig. 1). For 
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A < 0.3 m, references 73 and 74 give more stringent constraints than references 
76 and 77, and for A > 0.3 m, vice versa. References 76 and 77 investigate the 
existence of hypothetical anomalous spin-dependent forces by sensing the inter- 
action of polarized trapped ions with fermions in the earth. These experiments 
are more sensitive to longer range forces, while experiments with laboratory 
sources are more sensitive to shorter range forces. Reference 8 has the best limit 
for 0.1 m < A < 8 m. Our works [19,73] have the best limit for A < 0.1 m. 
In figure, we also show the proposed sensitivities of the STEP spin-coupling 
experiment [79,66] and the AXEL spin-coupling experiment (AXial Experiment 
at Low temperature) [80,81,18] together with allowed region of present axion 
models. 

Speake’s group at the University of Birmingham is working on the develop- 
ment of a new superconducting torsion balance to detect force on the mass for 
the spin-coupling experiment. H.J. Paik at the University of Maryland proposes 
to use superconducting accelerometers for a spin-coupling experiment with a 
high Q. They are also aiming at very significant improvement. 

In Fig. 1, there are magnetic resonance experiments, torsion balance experi- 
ments and SQUID experiments. In the following, we give a taste of experimental 
procedure using a SQUID experiment [19]. The experiment measures the effec- 
tive BeS field produced by hypothetical axion or axion-like interaction while 
magnetic field is shielded by two niobium superconducting shields. Equation 
(14) can be written in the form 



TLint — • .Beff — ' B, 



(15) 



with /ie = — I /ie I the magnetic moment of the electron. Hence in this experi- 
ment, we sensitivly measure Beg field given by 



Bes = --^^ + 

IJ^eOTTnieC \Ar J 



(16) 



The scheme of our experimental setup is shown in Fig. 2. Our copper mass is 
sitting on one side of the turntable underneath the dewar. In the data taking, a 
laser beam and a chopper-photodetector systm is used to lock the output signal 
of the dc SQUID to the rotation angle of the polarized bodies. The laser beam 
is intercepted by the chopper when the copper axis is in line with the axis of the 
paramagnetic salt. We define this angle to be zero degree, and expect the cr ■ r 
interaction signal to be proportional to cos 0, where 0 is the angular position 
of the copper mass. 

To start the measurement, we set the turntable with copper mass rotating at 
0.96 cycle per second with a stepping motor system. The stability of the rotation 
speed is better than 10“^. The output of voltage of the dc SQUID system for 1 (j)o 
from the most sensitive scale of the dc SQUID controller is 10 V. This output is 
further amplified 1000 times and low-pass filtered to 10 Hz bandwidth, and then 
read into a computer with an analog to digital converter. The angular position 
of the copper mass is simultaneously read into this computer. The typical noise 
of the SQUID output after 1000 times amplification and 10 Hz low pass filtering 
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Fig. 2. Schematic for spin-coupling experiment. 



as recorded by ADC (Analog-to-Digital Converter) is about ±300 mV. This 
is consistent with the dc SQUID noise 200 mV/\/Hz after amplification. When 
we average the data for 400 cycles, the typical output is about ±50 mV and 
the pattern repeats. To subtract this interference background, we average the 
data for 4-5 hours, alternatively take away and put back the copper cylinder to 
average the data for another 4-5 hours and subtract the results to find the net 
effects. 

The weighted average of the six runs for the amplitude of cos 0 component is 
(0.49±2.34) mV. Expressed in terms of flux amplitude, it becomes (0.49±2.34)x 
lO”"^ </)o- Converted to BeS, we have (1.13 ± 5.38) x 10“^^ Gauss and the cou- 
pling constant gagp/fx is (0.14±0.67)x 10“^® for A > 30mm. Our experimental 
constraint on the coupling constant gsgp/hc improves over previous results by 2 
orders of magnitude at A = 30 mm. Further improvement will be implemented. 
For finite-range Leitner-Okubo-Hari Dass interaction, the dimensionless param- 
eter A is constrained to less than 10 for the range parameter A = > 30 mm. 

6.3 Spin— Spin Coupling Experiments 

Usually the dipole-monopole (spin-mass) part of an interaction is larger than 
its dipole-dipole (spin-spin) part. Monopole-monopole part is sometimes larger 
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if there is no constraint. However the monopole-monopole part does not change 
with polarization and is usually harder to detect. Therefore in search for a new 
interaction, searching for dipole-monopole part is usually more significant. This 
is true for axion search. However, for axial photon [42,43] search and arion [82] 
search, the search for anomalous spin-spin interactions give stringent constraints. 

Let as be the strength of the anomalous spin-spin interaction compared to 
the magnetic spin-spin interaction. The pioneer work of Graham and Newman 
[83] used carefully prepared hybrid split toroids of GdNis/NdNi at supercon- 
ducting temperatures with a torsion balance of the feedback deflection type. 
Their experimental result assigns uncertainties in two parts: statistical at the 1 
a level and an estimated systematic uncertainty = (8.0 ± 6.3 ± 1.1) x 10“^^. 
The torsion balance experiment led by Ritter at the University of Virginia uses 
the period method and gives the constraint ag < 1.5 x 10“^^ [84]. Adapting 
the induced ferromagnetism method of Vorobyov and Gitarts [85] , we use a low 
noise dc SQUID system to search for the interaction of spins in a spin-polarized 
test mass and those in a paramagnetic salt, separated by a /i-metal shield and a 
double-layer superconducting shield [15-17]. Our result limit the strength of Ug 
to ag = (1.2±2.0) X lO”^"* [17]. This limits the coupling of axial photon and the 
arion coupling to a level much lower than originally proposed. We are currently 
working on an improvement of this experiment. 

6.4 Cosmic Spin Coupling Experiments 

Hughes-Drever experiments [57,58] test the Cosmic Spatial Isotropy for spin 3/2 
particle very precisely. Recently, frequency and clock experiments push this limit 
even further. 

As to the spin 1/2 particle, Phillips [86] used a cryogenic torsion pendulum 
carrying a transversely polarized magnet with superconducting shields to set a 
stringent limit of 8.5xlO“^®eV for the splitting of the spin states of an electron 
at rest on Earth. In our laboratory we have used a room-temperature torsion 
balance with a magnetically-compensated polarized-body and set a spin energy 
level splitting limit of 3 x 10“^®eV [87,88]. Berglund et al. [89] use a magnetic 
resonance technique and set a limit of 1.8 x 10“^® eV on the energy splitting. 

For the analysis of cosmic anisotropy for electrons, we can use the following 
Hamiltonian: 

Hcosmic = Ciai + C 2 U 2 + (17) 

in the cosmic frame of reference. This includes the following two cases, (i) 
77cosmic = gcr ■ n where n is a cosmic vector which defines a preferred direc- 
tion, with Cl = gni, C2 = gu2, C3 = gn^ as considered in the references [87,88]; 
here G’s are constants, (ii) iLcosmic = go' ■ v where v is the velocity with re- 
spect to the comic background defined by the isotropy of background radiation, 
with Cl = gvi, C2 = gv2, C3 = gv^ as considered in the context of references 
[90,91]; in this case, since v is largely the velocity of our solar system through 
the cosmic preferred frame, to a first approximation, G’s are also constants. For 
convenience, we use the celestial equatorial coordinate system from the center 
of earth for our laboratory position, 
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i.e., the earth rotation axis (North pole 
direction) as z-axis and the direction 
of the spring equinox as the positive 
x-direction. All the above experimen- 
tal constraints are on Ci and C 2 . The 
constraints on C 3 are crude. 

To improve the precision and to 
constrain on C 3 , we use a rotatable 
torsion balance carrying a transversely 
spin-polarized ferrimagnetic DygFe 23 
mass. With a rotation period of one 
hour, the period of anisotropy signal 
is reduced from one sidereal day by 
about 24 times, and hence the 1/f 
noise is greatly reduced. Our present 
experimental results constrain the cos- 
mic anisotropy constant Ci, C 2 , O 3 to 
+ Cl < (1.8 ± 5.3) X 10-21 eV 
and I C3 |< (1.2 ±3.5) x lO"!® eV. 

This improves the previous limits on 
(Cl, C 2 ) by 120 times and C 3 by a fac- 
tor of 800. Our experimental set-up is 
schematically shown in Fig. 3. 

The angular velocity of the cosmic 
signals is f2+LO, fi—oj, and oj. By the earth rotation the projection of the electron 
spin in the x — y plane rotates to opposite direction relative to the neutrino 
background or cosmos after half of a sidereal day (11 hours 58 min 2 seconds). 
Adding the two data sets separated by half sidereal day, we can eliminate the 
fi+ui, f2—w term, and estimate C 3 . Subtracting between the same two data sets, 
we can eliminate the u term. With 4 sequential data sets (each set’s separated by 
half sidereal day) in opposite rotational direction of rotatable table, the signals 
with frequencies 12 ± w, Q — u, iv can be separated. The results of eight such 
sets of runs gives the limits on Ci, C 2 , C 3 just mentioned. This experiment also 
gives a stringent CPT test [94] . 

7 Outlook 

We have reviewed and discussed theoretical motivations and experimental sear- 
ches for the role of spin and polarization in gravity. Efforts in this direction 
will bear fruits both in fundamental physics and cosmology. For example, the 
study of equivalence principles leads to polarization effects in cosmology to be 
tested by astrophysical observations, and experiments on spin-polarized bodies 
constitute strong CPT tests. 

The ultimate searches for the role of spin in gravity is to measure the gyro- 
gravitational ratios of particles. For these searches difficult experiments are to be 




Fig. 3. Experimental setup for spatial 
isotropy test. 
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performed. As we have already seen, the difficulties of the Stanford Gyroscope 
experiment to be launched in 2002 and the technological achievement already 
made are very great. However, there is a chance of success by using the following 
method: 

(i) Polarized-body method [22], 

(ii) Atom interferometry [95], 

(iii) Superfluid ^He [96]. 

Let us look into the future. 
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Spin is the ultimate gyroscope. The smallest possible amount of angular 
momentum is - that possessed by a spin ^ particle. When the day comes 
that it becomes realistic to theorise about and to measure the precession of a 
spin i particle it will be necessary to have to hand the relevant theoretical tools; 
in other words, to be able to give a description of spin one-half particles which is 
consistent with Special Relativity, and to generalise that description to General 
Relativity. In general terms, then, this is an exercise in relativistic quantum 
mechanics, and in the case of General Relativity, in quantum mechanics in a 
curved space. This latter is, of course, different from quantum gravity. Quantum 
gravity is a theory describing the quantum nature of the gravitational field itself, 
for example in terms of gravitons. For our purposes the gravitational field is 
treated classically, as a curved space-time. The only thing to be quantised is the 
spin i particle. 

It may be thought that this exercise has already been performed, since the 
Dirac equation is nothing other than a relativistic equation for spin | particles. 
It turns out, however, that the Dirac equation itself does not automatically 
yield a relativistic spin operator. The problems connected with finding such 
an operator were already identified in 1950 by Foldy and Wouthuysen. After 
outlining these problems, we shall describe how this operator is constructed. 
The paper concludes with some remarks about the extent to which it makes 
sense to talk about spin in the context of general relativity. In particular it will 
be pointed out that spin precession is inevitable in GR; there is no such thing as 
conserved spin in curved spacetime. 

1 The Dirac Equation 

The Dirac equation is a first order wave equation for spin one-half particles. In 
the usual notation, with a metric (+, — , — , — ) and c = 1 , it is 

E + ^ ■ p)tp = rmlj (1) 

To satisfy the Einstein (mass-shell) condition — p ■ p = it is found that 
the coefficients 7 ^ cannot be ordinary numbers, but must be matices - in fact, 
4x4 matrices. In the “standard representation” (see for example [1]) 

7° = (J , 7' = (0 -a* 0) (2) 
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where each of the entries above is a 2 x 2 matrix and cr* are the Pauli matrices. 
Since the 7 s are 4x4 matrices, the “wave function” ^|) must have 4 components. 
We may write 



V' = 




( 3 ) 



where the “Pauli spinors” u and v each have two components. 

Dirac was setting out to find a wave equation for the electron; that is, for 
a spin i particle. It is reasonable, therefore, to ask what there is in the above 
to convince us that equation (1) does indeed describe a particle with spin A 
spin i particle has two spin components and the generators of the group SU(2) 
(the covering group of SO(3)) are proportional to the Pauli matrices; we have 

(4) 



It is, of course, very well known that the reason ip above has 4 components 
rather than 2 is that the Dirac equation predicts the existence of particles and 
antiparticles together, and they each have spin It is, then, a simple matter to 
define , . 



r* = 



CT* 0\ 

0 crV ’ 



( 5 ) 



and I A* will clearly also obey the commutation relations (4) above. Moreover, 
the Dirac equation is a relativistic wave equation so the proposition immediately 
suggests itself that is in fact the relativistic spin operator. This proposition 
runs into a problem, however. The Dirac Hamiltonian is 



H = 7°(m + (6) 

' ^ \cr ■ p —m J 

and the problem is that [S^,H] ^ 0; spin, given by the operator (5), would not 
be a conserved quantity. It turns out that the Dirac equation suffers from another 
problem, which turns out to be related to this one; or, at least, the solution to the 
one problem is also the solution to the other. This problem is that the velocity 
operator v = dr/dt has eigenvalues ±1, i.e. ±c, so Dirac particles should travel 
at the speed of light. This is clearly inconsistent with relativity. 

These problems were addressed and solved by Foldy and Wouthuysen almost 
half a century ago [2] . Their strategy was to find a transformation of the wave 
function ip, ip' = under which the new Hamiltonian 

H' = , (7) 

is block-diagonal. Equivalently, it should contain no terms which coupled the 
two-spinors u and v (see equation (3)) - in the words of FW, no “odd” terms. 
The required transformation turns out to be 

^ E + m + ij-p 
y/2E{E + m) ’ 



(8) 
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and the new Hamiltonian is 

TTf ^ ( \Zrn^ +p‘^ 0 \ 

\ 0 \Jtv? + p^J ’ 



(9) 



a sensible result indeed! Under the FW transformation a general operator A 
becomes A' = so the new spin operator is 



X = 



„-iS 



■P 



+ X P 



2E{E + m) 2E 



(10) 



This highly non-linear operator is called the “mean spin operator” by Foldy and 
Wouthuysen. It commutes with the Hamiltonian (6) and therefore corresponds 
to a conserved quantity. When acting on positive energy states (those for which 
<T ■ p = j^E — the mean spin operator becomes 



V 1 0 P 

-^FW=yCry - , r 

2 2[E + m) 



7 ^( 1 + 7 °). 



( 11 ) 



Historically the Foldy- Wouthuysen transformation has been regarded as an in- 
trinsically non-relativistic procedure - as the “correct” way of taking the non- 
relativistic limit of the Dirac equation. This would seem to be the philosophy of 
the treatment by Bjorken and Drell [1], for instance, as well as by Hehl and Ni 
[3]. What I shall show, however, is that the FW operator above, when acting on 
positive energy states, is in fact the relativistic spin operator for spin 1 particles. 



2 Spin and the Poincare Group 

In non-relativistic quantum mechanics spin is consistently treated as being as- 
sociated with rotations, and therefore, in group theoretic language, to be ap- 
proached by analysing the rotation group. This is SO (3) in the simplest ap- 
proach; but SO(3) does not have representations giving spin To treat these 
particles one has to observe that the covering group of SO(3) is SU(2), and SU(2) 
has a two-component complex spinor ( “Pauli spinor” ) as the basis state for its 
fundamental representation. In the non-relativistic domain, then, a satisfactory 
understanding of spin emerges if one views it as described by the covering group 
of the rotation group. 

On entering the relativistic domain, however, a fresh look has to be taken. It 
was Wigner who took the freshest look at this problem and whose 1939 paper [4] 
(1965) gives us the profoundest understanding of spin. Wigner pointed out that 
what is involved is not the homogeneous, but - surprisingly - the inhomogeneous 
Lorentz group, or Poincare group; that is, the group of rotations, Lorentz “boost” 
transformations and translations in spacetime. This group is the isometry group 
of the Minkowski metric - though Wigner did not describe it in these terms. The 
surprising thing is that translations should be relevant to the task of describing 
spin, but Wigner’s argument, paraphrased rather drastically, was that spin was 
the additional “label” necessary to identify quantum states after their mass had 
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already been specified. Mass and spin are the two quantities by which states can 
be specified, and they are both kinematic in origin. Mass (rest-mass, that is) is 
given by the Einstein relation, mentioned above (with c = 1) 

M^ = P^P'' = e2-P-P. (12) 

E and P are, however, the generators of translations in time and space and it 
is because of this that it is the Poincare group, rather than the homogeneous 
Lorentz group, which is the group of significance in this problem. In group theo- 
retical language P^P^ is the first Casimir operator of the Poincare group; that is, 
a quadratic function of the group generators which commutes with all of them. 
The Poincare group is, however, a group of rank 2 so there should be two Casimir 
operators. If the first one gives mass, which it clearly does by equation (12), the 
second one should yield spin. Wigner was unable to find an explicit expression 
for the second Casimir operator, but he did pioneering work on elucidating the 
group structure of the problem. The crucial step to take was to observe that the 
group which described spin was the little group of the Poincare group; that is, 
the subgroup of it which leaves invariant a given 4-vector P^. Wigner showed 
that this subgroup depends on M^. States are classified as timelike, spacelike or 
lightlike according as > 0, < 0, or = 0. There is also the case where P^ = 0, 
which is the null case. It is only in the case of timelike states that the little group 
is the rotation group; in the other cases it is non-compact. This immediately 
explains, for example, why the photon does not have a longitudinal polarisation 
state, for that would only exist if spin were desribed by SO(3), which it is not 
for lightlike states. The problem of identifying specific spin operators, however, 
was left unsolved by Wigner. 

It is clear that a necessary property of the spin operators is that they should 
commute with the translation operators P^^ of the Poincare group. The other 
6 generators, however, usually denoted do not commute with P^. (It may 
be useful to remark what a “disappointment” this is, since the “obvious” spin 
operators would be Joi, (i = 1,2,3), which generate SU(2).) A crucial step 
forward, however, was made by Pauli, who defined [5] the operator 

J^"P" , (13) 

generally known as the Pauli-Lubanski pseudovector. It is, of course, a 4-vector, 
but because of the epsilon symbol only the intrinsic spin part of Jp^, and not 
the orbital part (which contains Pp), will contribute to Wp, so Wp clearly has 
something to do with spin. In addition. 



W = WpW>^ (14) 

commutes with all ten generators of the Poincare group, and is therefore the 
second Casimir operator of this group. For timelike states with mass M, W = 
M'^s{s + 1), where s is the spin, so it is clear that M and W are the invariants 
labelling mass and spin. The question still remains, however: what are the spin 
operators, which generate, for timelike states, the group SU(2)? 
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This question was all but answered by Giirsey [6] in 1965; the present author 
did the mopping up recently [7]. First the commutator 

= (15) 

is defined, as well as its dual 

(16) 

The combinations 

X^, = -i{W^, + iWp (17) 

Y^, = -i - iw;,) = tx;, ( 18) 

(19) 



may be shown to obey the commutation relations 






and similarly for Y. These are the same in form as the commutation relations for 
the angular momentum operators J^i, which generate the homogeneous Lorentz 
group, and it therefore follows, by familiar arguments, that the two sets of op- 
erators 

Xi — —Cijj^Xjf ^ , Yi — (^1) 

obey the SU(2) commutation relations 

[X^,Xj\ = eijkXk , = eijfeYfc • (22) 



They therefore generate two SU(2) groups, and they are both relativistic spin 
operators. In fact they are the operators appropriate for the left-handed and 
right-handed states, so that the spin operator appropriate for Dirac particles is 
given by 

— l5)^i+ + lb)^i ■ (23) 



Substituting the relations 

11^0 = ^crp, 



W. = yna. + 



P^ 



m) 



(j ■ p 



(24) 



into the above equations, we find 



Z = 




P 

2{E + m) 



75(1 + 7°) 



(25) 



which, on comparison with equation (11), is the same as the Foldy-Wouthuysen 
operator Xpw when operating on positive energy states. We have therefore 
shown that a relativistic spin operator can be defined for Dirac particles. The 
starting point for its discovery is the realisation that the group of motion, or 
isometry group, of Minkowski space, is the Poincare group; the spin operator is 
defined in terms of the generators of this group. 
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3 General Relativity 

The concluding remarks of the last paragraph make clear that in a general Rie- 
mannian space, since there is no such simple isometry group as the Poincare 
group, there is also no spin operator; spin, that is to say, will not be conserved 
in a curved space-time. It is therefore inevitable that spin precession will take 
place in a gravitational field. Relativists are, of course, quite familiar with this 
fact, and understand well the differing contributing factors to spin precession; 
Thomas precession, geodetic precession and Lense-Thirring precession. The pur- 
pose of the observation above is to place these facts in a somewhat more funda- 
mental context, by drawing attention to the connection between the existence 
of a (conserved) spin operator and that of an isometry group of the metric. 

It is interesting to note finally that there is one space besides Minkowski 
space with a ten parameter isometry group, and that is de Sitter space, a space 
of constant curvature which, in the Euclideanised version, can be regarded as 
S^, a hypersurface embedded in 5-dimensional Euclidean space. The “radius” of 
this may be denoted R and the limit i? — >■ oo corresponds to the transition to 
Minkowski space. The de Sitter group, like the Poincare group, has rank 2 and 
the associated two Casimir operators may be defined [8]. It turns out, hardly 
surprisingly, that in the limit i? — >■ oo these two Casimir operators become, 
in essence, mass and spin, the Casimir operators of the Poincare group. But 
this is only in the limit i? — ^ oo; in de Sitter space proper, the operator which is 
conserved does not correspond with pure spin. And in any case, as just remarked, 
de Sitter space is a very special case. In a general Riemannian space no conserved 
spin operator may be defined. If the curvature of the space is small - which in 
astronomical environments such as our own it is - then spin, as defined with 
reference to the Poincare group, is only approximately conserved. 



References 

1. J.D. Bjorken and S.D. Drell: Relativistic Quantum Mechanics (McGraw-Hill, New 
York 1964). 

2. L.L. Foldy and S.A. Wouthuysen: Phys. Rev. 78, 29 (1950). 

3. F.W. Hehl and W-T. Ni: Phys. Rev. D 42, 2045 (1990). 

4. E.P. Wigner: Ann. Math. 40, 149 (1939). 

5. W. Pauli: Ergebnisse der Exakten Naturwissenschaften 37, 85 (1965). 

6. F. Giirsey: Phys. Lett. 14, 330 (1965). 

7. L.H. Ryder: Gen. Rel. Grav. 31, 775 (1999). 

8. F. Giirsey: Introduction to the De Sitter group, in F. Giirsey (ed). Group Theoretical 
Goncepts and Methods in Elementary Particle Physics (Gordon and Breach, 1964). 




Testing the Dirac Equation 



Claus LammerzahP and Christian J. Borde^ 

^ Fakultat fiir Physik, Universitat Konstanz, 78457 Konstanz, Germany 
^ Laboratoire de Physique des Lasers, Institut Galilee 
Universite Paris 13, 93430 Villetaneuse, France 



Abstract. The dynamical equations which are basic for the description of the dy- 
namics of quantum felds in arbitrary space-time geometries, can be derived from the 
requirements of a unique deterministic evolution of the quantum fields, the superposi- 
tion principle, a finite propagation speed, and probability conservation. We suggest and 
describe observations and experiments which are able to test the unique deterministic 
evolution and analyze given experimental data from which restrictions of anomalous 
terms violating this basic principle can be concluded. One important point is, that such 
anomalous terms are predicted from loop gravity as well as from string theories. Most 
accurate data can be obtained from future astrophysical observations. Also, laboratory 
tests like spectroscopy give constraints on the anomalous terms. 



1 Introduction 

Experimental Quantum Gravity, the experimental search for deviations from 
Einstein’s General Relativity, which includes also Special Relativity, has been an 
very active area in physics since a few years. All unifying theories or quantum 
gravity theories predict small modifications from General Relativity, for example 
deviations from the 1 / r-potential, and violation of the equivalence principle, see 
e.g. [1], violation of Lorentz-invariance and violations of the universality of the 
gravitational red shift. Violations of these basic principles underlying General 
Relativity go together with a modification of the equations of motion for test 
matter in the gravitational field. 

Questions about the structure of the field equations for quantum objects came 
up very recently in the context of quantum gravity: From loop gravity [2] as well 
as from string theory [3] there are predictions about quantum gravity induced 
modifications of the field equations governing the motion of spin-^-particles. 
Up to now the predictions consist in directional derivatives and higher-order 
spatial derivatives added to the usual Dirac equation. However, more general 
modifications can also be expected. For example, the prediction of second order 
spatial derivatives, which is worked out in a distinguished frame of reference, 
makes it reasonable to expect also second order time derivatives. Therefore, it is 
important to study experimental consequences of general non-standard modifi- 
cations of the Dirac equation. In order to have a guiding principle at hand which 
tells us something about the meaning and the physical consequences of certain 
modifications of the usual Dirac equation, we will study these modifications in 
the frame of a constructive axiomatic derivation of the Dirac equation. 
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In that approach it is possible to derive the Dirac equation from a very few 
fundamental operational principles which can be tested directly in experiments. 
The derivation of the Dirac equation can be divided into two parts: The first part 
consists in the derivation of a system of linear hyperbolic field equations, which 
we call a generalized Dirac equation, from four fundamental principles which the 
quantum field has to obey, namely (i) the unique deterministic evolution, (ii) the 
superposition principle, (iii) finite propagation speed, and (iv) conservation of 
probability: 

unique deterministic evolution 
superposition principle 
finite propagation speed 
conservation law 

Here 7“ (a = 0,... ,3) are some matrices which are not assumed to fulfill a 
Clifford algebra. Also M is a matrix. 

The usual Dirac equation where the matrices 7“ fulfill a Clifford algebra and 
where M is proportional to the unit matrix, can be derived from the additional 
assumptions (v) uniquenes of the null cones, (vi) two helicity states only, and (vii) 
uniqueness of the mass shell, compare [4,5]. From these demands we arrive at 
a Dirac equation in Riemann-Cartan space-time, where the coupling to torsion 
consists in the axial part only. All these principles are operational since they can 
be proven directly by experiments. 

Some basic features of quantum theory mentioned above have been ques- 
tioned previously and subsequently been tested or estimated from some existing 
data on atoms, for example. One of these features is the linearity of quantum 
theory which is basic in our understanding of all quantum phenomena. A gener- 
alized quantum field equation including a non-linear term has been introduced 
and discussed by e.g. Shimony [6], and Weinberg [7], and references therein. Shi- 
mony himself proposed a neutron interferometry experiment which subsequently 
was performed by Shull et al [8] giving a strong restriction on the strength of 
a hypothetical nonlinear term in the Schrodinger equation. Also spectral data 
of the hydrogen atom have been used for this purpose. Another features which 
has been discussed was the conservation of probability [9]. The second part of 
assumptions (v) to (vii) manifests itself in a breaking of local Lorentz- and local 
position invariance which can be tested by Hughes-Drever, red-shift and atomic 
interferometry experiments [10]. 

In this paper we want to question the first of the four basic principles (i) to 
(iv) underlying our basic understanding of quantum theory, namely the unique 
deterministic evolution. Here, unique deterministic evolution means that if a 
quantum state is prepared at a time to> then the state is uniquely deter- 
mined for times t > Iq. This implies that the evolution of quantum states is 
described by an evolution equation which contains a first time derivative only, 
that is, -^4’ = where A is some operator. 

In order to test this principle, we propose a generalization of the usual Dirac 
equation by adding a second time derivative which violates this principle. This 



J generalized Dirac-equation 
(0 = i"f°'{x)da>p{x) — M{x)(p{x) 
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modified Dirac equation is used to calculate modifications of the propagation 
of spin- ^-particles as well as the corresponding modifications of the atomic 
spectrum (The hydrogen spectrum can also be used as justification of the four- 
dimensionality of space-time on atomic scales [11]). Up to now, all the experi- 
mental results are well explained using the standard theory, that is, the usual 
Dirac equation together with quantum corrections. If everything is well explained 
using the standard theory, then the modifications of the results due to the modi- 
fications in the Dirac equation, can be only smaller than the experimental error. 
Therefore, all the modifications can be restricted by comparing the calculated 
effects with the accuracy for the various measured effects. - However, future ob- 
servation of neutrinos and high energy photons from gamma ray bursts (GRB) 
may be capable to distinguish between the various models of the Dirac equation. 
Nevertheless, one has to keep in mind, that on cosmological distances the pa- 
rameters may depend on the position and thus the effect we are looking for may 
be cancelled during the propagation over long distances. Therefore, these obser- 
vations have to be complemented by laboratory experiments like spectroscopy. 



2 The Model: A Modification of the Dirac Equation 

The unique deterministic evolution implies that the evolution equation for the 
quantum field is of first order in the time derivative. This means especially, that 
the evolution equation is an equation without memory. In terms of a system of 
partial differential equations this means that this system should be of first order 
in time as it is the case for the Dirac equation ihdtip = —ihca. ■'Vip + If 

an evolution possesses a memory, the time derivative has to be replaced by an 
operator, for example, an integral expression: Bip = —ihca ■ V'i/'T Pmc^ip with, 
for example, Bij: = f B{t,t')ip{t')dt' . In the case that the kernel B{t,t') of that 
kind of equation possesses certain properties (it should be analytic), then one can 
expand that kernel resulting in a system of partial differential equations with an 
infinite sum of terms containing time derivatives of arbitrary order, 

Therefore, if the quantum field does not evolve uniquely deterministic or if 
quantum theory has a memory, then the resulting field equation in these cases 
contains arbitrary high orders of time derivatives. In a first approximation, this 
may be modeled by adding to a conventional quantum field equation like the 
Dirac equation or the Schrodinger equation, a term containing a second time 
derivative: 



ih—ib = —ihca • VU + — e — ^ vrwV' • (1) 

ot mc^ ot^ 

In order to make e dimensionless, we introduced a factor Ijmc^ in the term 
containing the last term. Here we assume that e is constant, i.e. does not depend 
on time or position. For e = 0, the above equation reduces to the usual Dirac 
equation. It is clear that the last term in (1) violates Lorentz covariance. 
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This modified Dirac equation can be derived from a Lagrangian: 

1 

+ -ihc (ib'^a ■ V\b — ■ OLth) — + e — (2) 

2 ' ^ mc^ 

= (V’7“'9aV' - 9aV’7“V') - mc^'ip'tp - e^^dt^-f^dtip ■ (3) 

This implies, in particular, that we have a conservation law 



Q = dtp + V ■ j 



(4) 



with the probability density 






p = - e — - ip^dtif) 



me 



(5) 



and a current 



j = hcip'^ ol4> ■ 



( 6 ) 



The coupling to the electromagnetic field can, as usual, be introduced by 
means of the minimal coupling procedure: 

C = ^ih + ie<f))'tp — {dt — iecj))tp'^ iIj) 

+ l-ihc • (^V — tp — (v + ■ ctV' 



he 



he 



h^ 



—mc^ j3ip + e — ^{dt — ie(j))'ip~^ {dt + ie(j))ilj . 



mc^ 



(7) 



The corresponding modified Dirac equation is 



le 



ihdtip = —iheoL • ( V — — A ) ip + fime'^ip + ecp+ e- — ^ {dt — ieep) ip . (8) 



3 Plane Wave Solutions and Neutrino Propagation 

It is not difficult to present an exact plane wave solution for (1). Inserting the 
ansatz ip = exp {i{Et — p ■ x)) a into (I) gives Ea = (a • pc + /3mc^ + a, 

or^ 

0 = (^ 7 ° + 7 >aC- a. (9) 

The corresponding dispersion relation reads 

(e ^E‘^\ —p^(?=m?‘c^ (10) 

V mc^ / 

Indices with a hat run from 1 to 3 and, for example, pa is represented by p. 



1 
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which possesses the four solutions 



= 



= 



mc^ — \J =F 4emc^Ayp^c^ + 



2e 



mc^ + \J ± 4emc^\/p'^c^ + w?c^ 



( 11 ) 



2e 



For small e this reduces to 



E^'^ = ±c\/p'^ + + e (rru? + — ^ 

\ m / 

^£ 2 ) ^ ^ ^ + w?c^ -e(mc^ + ^) . 

e \ m / 



(12) 



That means that for e — >■ 0 the solutions E^'^ reduce to the well-known solutions. 
The other two solutions are new and diverge for small e. However, this large 
quantity will drop out by considering energy differences. The ± in the solutions 
for the energy corresponds to positive/negative energies, the (1), (2) corresponds 
to the two solutions which come up with e yf 0. 

In the high energy limit m — >■ 0 we get (p = |p|) 



E^^> = ±cp ± 



2p 



+ € ( mc^ + — 
m 



( 2 ) mE 
E^ = ±cp± 



€ ^ 2p 

In the low energy limit p — >■ 0 we get instead 



I 2 , P 

— e me H 



m 



E^P = ±mc^ ± 



K 

2m 



I I 2 , P 

+ e 1 me H 

m 



E'f^ = 



me 



± m(? ± 



P 



,2 , P 



— e me + 



e 2m 

The exact expression for the group velocity reads 



{v^±r = ± 



(4"¥ = ± 



_|_ rn^c^sjl =F + m^c^ 

PoC 

+ m?c^ ^1 ± 4 ^ _|_ j7j2g2' 



(13) 



(14) 



(15) 



Therefore, for positive energies, particles with the same momentum but with 
different directions propagate with different velocities (the same is true for par- 
ticles with negative energies). This property can be used for a comparison with 
data from neutrino propagation. 
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For small e we get from (15) 

{v2Y = ± 

= ± 



for TO — >■ 0 



and for pa — >■ 0 



PaC 2e^ 

+ TO^C^ W 



PaC 



- 2e 



Pa 



a/p^ + TO^c2 W 



(4 ) = ± — CT ^^Pa + 2e— 



„(2)a 



P 

Pa 

P 



2p3 



9 

m e ^ Pa 



(4T = ±^cT^Pa-2e 



m 



(nW)“ = ±^+2e^ 

m m 

(^W)a ^ _ 2e^ , 

m m 



(16) 



(17) 



(18) 



For a comparison with data from the propagation of neutrinos, which may 
be produced in connection with GRBs, we use (17). We compare the arrival time 
of neutrinos with the arrival time of light over a distance of I = 10^° ly. If the 
neutrinos and the photons are produced during the same event, and if we take 
the mass of the neutrinos to be 1 eV and the momentum p = 10® GeV, then we 
get as difference of the time-of-arrival 

At = - - ^ ^ ±2e^] « (4x 10-9±6.4el0®i) sec. (19) 

c ^(F2) c V 8p4 me) ^ ^ ’ 



The first term can be neglected compared to the second one, so that we get 
\At\ = |e|6.4 X 10®^ sec. Asssuming a temporal structure of the source of about 
a millisecond [12] and assuming a null-result, then we can get from observations 
of the propagation of neutrinos and of photons the estimate 



jej < 1.6 X 10"®®. 



(20) 



Thus, neutrino observations in the future have the potentiality of high precision 
determination of the parameter e. Any jej larger than that given by (20) should 
be detectable by this means. 

In quantum gravity theories, jej is proportional to the ratio of the Planck 
length and some intermediate length, e = nlp/L where k is assumed to be of the 
order 1 [38]. If we take L = h/p, then, in terms of k, the above estimate means 
|«;| < 1.6 X 10“^^ which certainly is in contradiction to the assumption that k 
is of the order 1. From this we conclude, that, if the Dirac equation contains an 
additional quantum gravity induced term with the second time derivative of the 
neutrino field, then this term should be observable in the future by comparing 
neutrino propagation with photon propagation. 
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However, from the derivation of the modifications of the Dirac equation [38] , 
the parameter e or k may be constant over the scale L of the “weave” states only. 
Therefore, it may be possible that the effect, as we calculated it, may not occur 
due to an averaging to zero over larger distances. Consequently, it is necessary 
also to perform tests which take place on a small scale only. One kind of such 
tests is atomic spectroscopy what we are going to discuss below. 



4 The Non— relativistic Limit 



4.1 The Non relativistic Field Equation 

First we calculate the modified Pauli equation corresponding to eqn (1). In order 
to do so, we first subtract the rest energy from the wave function by means of 
the substitution 






tp = e 



( 21 ) 



resulting in an elimination of the rest mass in one part of the wave function. 
This gives 



(1 + 2e(l + e))ihdtip' = —ihcot • ( V — + (/? ~ + etppj' 






mc^ 



{dt + iecpYip' . 



(22) 



With the projection operators P± := ^(1 ± (3) we define the ‘large’ and ‘small’ 
parts of the wave function: = P±'tp'. Multiplication of (1) with and P_ 

gives the two equations 



(1 + 2e(l + e))ihdt4’+ = —ihca. • ( V — — A \ ^p'_ + e(j>'ip+ ~ e — y(5t + ie</>)^'i/’^ 



he 



(1 + 2e)ihdt'ip'_ = —ihca ■ ) ip'^ + etpip- — 2m<?%p'_ 

h^ 



- e — ^(5t + ie(j)Y'ip'_ . 



(23) 



(24) 



As usual, we assume that in the second equation the energies ihdtip'- and e(f>ip- 
are small compared with the rest mass term Therefore, we approximate 



1 



ip'_ « -- — iha • V - —A V’h 



2mc 



le 



he 



17 + . 



(25) 



Inserting this into the first equation (23) gives 



(1 + 2e(l + e))ihdtY+ = ^ 



le 



he 



h- 



— S-B-e—{dt + ie4>YY+, 



me 



me 



(26) 
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with S = 



(j 0 
0 (T 



and -B = V x A. Here cr are the three Pauli matrices. After 



division through 1 + 2e(l + e) we get 



ihdttp'+ = - 



2m* 






+ 



■e*(l)'ip'+ E B-e- 



' m*c '^m*c^ 



{dt + 



2 „;/ 

+ ’ 

(27) 



where we absorbed the e-factor in a redefinition of mass and charge: m* := 
(1 + 2e(l + e))m, e* = e/(l + 2e(l + e)). 

As our result we get a modified Schrodinger equation for a two-spinor 

^ ) V' + H cr • B - + iecffxp . 

ot 2m \ nc J me m c‘^ 

(28) 



This equation can also be derived from a Lagrangian 

^ - ie(j))'ip - {dt + ie(j))ij;*ii) (29) 

-2^ (^ + S"*) ^ - yA) * - ■ 

Equation (28) also possesses plane wave solutions whose energies are given 
by (14). We also have a conservation law 

^p + Vj = 0 (30) 

with the probability 
and the current 

itr 

J = ^ {^ri’ - ■ (32) 

Like in the Klein-Gordon equation there seems to exist the possibility to get 
negative probablities. However, using the Schrodinger equation, we get for the 
probability 



p = tp*tjj - e^— f-ifi^tp*ip + 



= tp*tp + e- 



mc^ 
1 



dt 



dt 



A'ijj^ + iP*^A^P 



2m 



2m* 



(33) 



This quantity is strictly positive if e < 0, and for e > 0 this is positive if |e| is 
small enough. 
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4.2 Modifications of the Energy Levels 



It is possible to calculate the energy levels of the hydrogen atom exactly. In 
order to do so we choose A = 0 and </> = </>(r) as the usual spherically symmetric 
electrostatic potential: 

• 4''’ = 04 - 4 



We asume a stationary solution, then ihdt'^p = Etp, and we get from the above 
Hamiltonian 






e* -2e ^E] 



m*c^ 



With Z\ = 4 gy gy ) ~ 4-^ > '''^here L is the angular momentum operator with 

the eigenvalue equation L Yj™ = 1(^1 + with I = 1,2,3, With a splitting 

of the wave function into a radial and an angular part ^|) = R{r)Yj^ {{) , ip) we get 
the radial part of the wave equation 



E^\R=- 



f I d 



2m* dr V dr 



R — E l)i? 



e* - 2e- 



,eU^R. 



With the explicit expression for the electrostatic potential 4> = —ejr we get by 
multiplication with 2m* 



ar^ r dr 



e* - 2e- 



-,E^ R 



iE)lR-^{Ki + i) + ^^)R^ (37) 



Since the r-dependence is the same as for the usual hydrogen atom, this equation 
can be solved using the standard scheme: With 



e* ^E], e-.= l(l+l) + 2ea^ 

m*c^ ) 



where a = j he is the fine structure constant, we get for the radial part of the 

wave function 

We introduce dimensionless coordinates by 



/ ^ pi / 

r = —, -£ = -r^, q = qro 

ro 4rg 
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and get with R'{r') := R{r{r')) 



dr'^ 



2 d 
r' dr' 



1 

4 



i?' = 0. 



We also introduce a new variable f{r') through 



(41) 



(42) 



with a parameter 7 which will be specified later. We get the following equation 
for the function /: 



0 = r 



dr'^ 



+ (27 + 2 — r) 



df{r') 

dr' 



/ 7(7+ 1) 
\ r' 



+ g' - 7 - 1 Z(r') . (43) 



In order to solve this equation we specify the value of 7 by the requirement that 
the term ~ 1/r' should vanish: 



7 (7 + 1) — = 0 . 



(44) 



This gives the two possibilities 



7± = -i±^/^=-i±^(z+i)' + 2ca2, 

and from (43) the differential equations 

zf'+{^±-z)f-(3±f = 0, 
with 

^?± = 27 ±+ 2 =l± 2 W£+i 

/3± = 7± + 1 - g' = i ± i - q . 



(45) 



(46) 

(47) 

(48) 



Eqn. (46) is the confluent hypergeometric differential equation with the solution 
[13] 



f{fi±,'d±,z) 



^ P±\{'d± + v)\ 



(49) 



which is appropriate for our problem. 

It is clear that, in order to get no infinite terms, •d± is not allowed to be 
a negative integer: 'di yf -1,-2 ,..., which is fulfilled if e yf 0 and jej < 1. 
For e = 0 we cannot use the solution -d_. In addition, if the sum in (49) does 
not terminate, then the solution diverges for large r faster than exp Qr') which 
leads to non-normalizable solutions. The condition for a termination of the sum 
is /3± G Z“, or 



!3± = -k, 



fc G N . 



(50) 
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With (48), (40), and (38) we get four energy eigenvalues E 

( 1 ) m*c2 - 2eaa* ^ m*(? Kl + 2eaa*{l - 

~ Kl - 2ea2 ~Ye Kl - 2ea^ 

(2) m* — 2eaa* m*c^ + 2eaa*(l — 2-pr) 

^ ~ Kl - 2ea2 2e - 2ta^ 



(51) 

(52) 



with K± = A: — ^ =F \J 5)^ + 2eo;^ and a* = e*^ jhc. To first order in e we get 



e''}'^ = m*c^ 



1 1 4 5 — 8/c + 2; 2\ \ frn\ 

e ^ T(l-fc + 0" ~ “ ^4(1 -A: + 0^(1 + 2/) + ^ 



e''}'^ = m*c^\- 
e 



E^^ = -m*c^ 



77(2) * 2 

£/_ = — m c 



1 



1 



2 (A: + 0^ 
^2 1 
2 {1-k + iy 

cr2 1 

2 (fc + /)2 



^ 3 -8k -21 

4{k + l)^{l + 2l) 

4 5 - 8A: + 2/ 

4(1 -A: + 0^(1 + 20 



0(e2) 



(54) 



+ 0(e2) (55) 



4 3 8k 21 ^ ^^^2^ 



4(A: + 0^(1 + 2/) 



(56) 



The solution is then given by 

R{r) = R{r{r')) = R'{r') = Ae~^"^ r''^f(r') = Ae~^^ f{-k,i!}±,r') (57) 



where + is a normalization constant and where all parameters depend on 1. 

For e yf 0 all these energy values are well defined. Even for very small e the 
first two energies (53,54) are valuable solutions, too, since only energy differences 
are observable and thus the first term drops out. However, there are two reasons 
which justify to drop the first two solutions: (i) Except the first term mc^/e, the 
upper two sets of energy levels (53,54) are the same as the lower two sets (55,56) 
up to sign. For the first two sets of energy levels the continuum is below the 
discrete spectrum. By postulating that all particles fall into the lowest energy 
level, then all atoms will fall into the continuous part of the spectrum which has 
never been observed. (One also can postulate that all particles want to go to the 
highest energy level. But due to the symmetry of the two sets of spectra, this 
will give the same answer. Therefore we do not consider the upper two sets by 
convention.) (ii) We want to describe small modifications of the known energy 
levels given for e = 0. Owing to these reasons, we keep the last two sets of energy 
levels. 

( 2 ) 

Defining for E_j_ ^ the principal quantum number n := 1 — k + I, then the 
energy levels are given by 



E^^ = —m*c^^ ( +7 



ea 



/ 3 4 

~ n3(l + 20 



0(e2) 



(58) 
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and if we define for E_ the principal quantum number n := k + 1, then the 
energy levels are 



t;i(2) » 2 

E_ = —m c 



3 



n3(l + 2Z) 



0{^) 



(59) 



where nu^a^ /2 is the Rydberg constant Ry. It is remarkable, that in both cases 
we get the same scheme of energy differences. Therefore, at this order we cannot 
distinguish between energies and 

There are two differences between this result and the usual spectrum of the 
hydrogen atom: (i) No degeneracy with respect to 1. (ii) Additional 1/n^- and 
l/n^-terms. From (58) we get for the Lyman-series with n = 1 and I = 0 as 
ground state, for example, the energy differences 

AELym.n = Ry (l ~ ^ + ca^ - 4 (^1 - ^3(i + 2o)) ^ ^^""0 ■ 

(60) 

The additional terms 1/n^ and n^{l + 21) modify the structure of this series. 
The ionization energy is given by n — >■ oo and is 

Aionization = RU ~ ^(^^)^ ■ (®1) 



From numbers related to AifLyman or Ajonization one can draw estimates on the 
value of e. 

Since the accuracy 5AE/AE for recent measurements, see for example [14], 
is of the order of which agrees completely with the conventional theory, 

we can conclude that the relative deviation from the usually calculated energy 
has to be smaller than this uncertainty: 



6AE 

AE 




< 10 "^^. 



(62) 



This means that 



e < 10-^3- 







13 

8 + 



7 X 10"^°. 



(63) 



In terms of a k as introduced after Eqn. (20) we have the estimate k < 7 x 10"^ 
which is outside the assumption that n is of the order unity. Therefore, the 
accuracy of atomic spectroscopy is still at least five orders of magnitude too 
small in order to be able to detect any influence of quantum gravity on atomic 
levels. 



5 Conclusion 

We have shown that an additional term in the Dirac equation containing a sec- 
ond time derivative, as it is motivated from quantum gravity, influences neutrino 
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propagation and atomic spectroscopy. While neutrino propagation may be ca- 
pable to ‘see’ this additional term, the accuracy of spectroscopy has still to be 
improved by five orders of magnitude in order to be sensitive to this term. 

A clear difficulty with the hydrogen spectrum is that, precisely, it cannot be 
calculated exactly with an arbitrary accuracy with the present state-of-the-art. 
There is a number of corrections to the Dirac solutions: recoil, QED, finite nu- 
cleus size, see [16] for a review. These corrections scale with \jv? and amount 
to a few kilohertz. They are very difficult to calculate and there are still some 
discrepancies between theoretical results. Fortunately, some combinations of fre- 
quencies are independent of these corrections at their leading order and can be 
used for higher accuracies. Concerning the measurements themselves, there is 
presently a very rapid evolution towards much higher accuracies. For example, 
it was shown recently that, because of the extreme regularity of the frequency 
comb generated by femtosecond lasers over a very wide spectrum [17], they 
could be used to compare frequencies of oscillators which differ by several or- 
ders of magnitude. This provides a way to compare many transition frequencies 
of the hydrogen atom between themselves and with microwave clocks, with the 
potential accuracy of the cesium fountain clock which is presently 10“^® and 
should improve quickly by another order of magnitude. Also, the techniques to 
interrogate narrow transitions of cold atom hydrogen by sub-Doppler methods 
or atom interferometry have improved very significantly either in cold thermal 
beams (Hansch and coworkers in Garching, Biraben and coworkers in Paris) or 
in clouds generated from Bose-Einstein condensates (Kleppner and coworkers at 
MIT). A subkilohertz linewidth is presently achieved for the 1S-2S two-photon 
transition [17] and could still be reduced by one or two orders of magnitude in 
the near future. The hydrogen atom is thus potentially a universal clock by itself 
covering the full spectrum from UV to microwaves (hydrogen maser) . 
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Abstract. The coupling of the electromagnetic field to gravity is an age-old problem. 
Presently, there is a resurgence of interest in it, mainly for two reasons: (i) Experimental 
investigations are under way with ever increasing precision, be it in the laboratory or by 
observing outer space, (ii) One desires to test out alternatives to Einstein’s gravitational 
theory, in particular those of a gauge-theoretical nature, like Einstein-Cartan theory 
or metric-affine gravity. — A clean discussion requires a reflection on the foundations 
of electrodynamics. If one bases electrodynamics on the conservation laws of electric 
charge and magnetic flux, one finds Maxwell’s equations expressed in terms of the 
excitation H — {T>, H) and the field strength F = {E, B) without any intervention of 
the metric or the linear connection of spacetime. In other words, there is still no coupling 
to gravity. Only the constitutive law H = functional(E') mediates such a coupling. We 
discuss the different ways of how metric, nonmetricity, torsion, and curvature can come 
into play here. Along the way, we touch on non-local laws (Mashhoon), non-linear ones 
(Born-Infeld, Heisenberg- Euler, Plebanski), linear ones, including the Abelian axion 
(Ni), and find a method for deriving the metric from linear electrodynamics (Toupin, 
Schonberg). Finally, we discuss possible non-minimal coupling schemes. 



1 Introduction 

General relativity was proposed in 1915. One of its predictions was the bending 
of light rays of stars in the gravitational field of the Sun. This effect was verified 
observationally soon afterwards by Dyson et al. in 1920 and put, as a result, 
Einstein’s theory in the forefront of gravitational research. 

Within the framework of general relativity, a light ray can be extracted from 
classical electrodynamics in its geometrical optics limit, i.e., for wavelengths 
much smaller than the local curvature radius of space. Accordingly, the bending 
of light can be understood as a result of a nontrivial refractive index of spacetime, 
see Skrotskii et al. [81,89], due to the coupling of the electromagnetic field F to 
the gravitational field g. Classically, we have in nature just these two fundamen- 
tal fields F and g, the weak and the strong fields being confined to microphysical 
dimensions of 10“^®m or 10“^®m, respectively. Therefore, the coupling of F and 
g is of foremost importance in classical physics. 
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The conventional way that coupling is achieved is to display the Maxwell- 
Lorentz equations of vacuum electrodynamics in the (flat) Minkowski world of 
special relativity theory in Cartesian coordinates. For this purpose, usually the 
formalism of tensor analysis (Ricci calculus) is used, see [78]: 

,j = F , F,,, fc + + Fkij = 0 . (1) 

Here Fij = -Fji = {Fqi, Fq 2 , Fq^, F 23 , F 31 , F 12 ) = {E,B) is the electromagnetic 
held strength, P the electric 4-vector current, and 

F^^ (2) 

with g*-' as the contravariant components of the metric. The commas in (1) 
denote partial differentiation with respect to the local spacetime coordinates a;*. 

If we switch on gravity, the flat Minkowski world becomes curved, the space- 
time geometry now being a Riemannian geometry with a variable metric gtj (x) 

of Minkowskian signature (-1 ). The coupling of the Maxwell-Lorentz set 

(1) to gravity is now brought about by the comma goes to semicolon rule , — >■ ; 
(see [52]), where the semicolon represents the covariant derivative Vi = -i with 
respect to the Riemannian connection ( “Levi-Civita connection”): 

F^^ = r , F,j,u + F,k■,^ + Fu-,j = Q. ( 3 ) 

This translation rule from special to general relativity is also alluded to as min- 
imal coupling with the additional understanding that the components of the 
metric in (2) become spacetime dependent fields. 

The metric held gij{x), entering ( 3 ) via (2) and via the covariant derivatives, 
i.e., via the semicolons, has to fulfill the Einstein held equation, 

Ricij - - gij Rick = k T ^ + T j , (4) 



with 

:= , Ti ^ := t (- F,kF^>^ + ^ S{ FuF>^^) . (5) 

V Mo V 4 / 

Here Riji is the curvature and T ij the material energy-momentum tensor. The 
coupled Einstein-Maxwell system describes correctly a wealth of experiments, in 
particular the gravitational bending of light, the gravitational redshift, the time 
delay of radar pulses in the gravitational held of the Sun, and the gravitational 
lensing and microlensing of starlight in the gravitational held of galaxies. 

But in all these experiments, we study the propagation of light along null- 
geodesics in a prescribed (and perhaps slowly varying) gravitational held which is 
a solution of the Einstein vacuum equation - and not of the eZectro- vacuum equa- 
tion. We could call this the non self-consistent Einstein-Maxwell theory. In the 
solar system, e.g., the Schwarzschild metric is taken as solution of the Einstein 
vacuum equation and the motion of a “photon” is described by the null geodesic 
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equation on this background. A true novel effect of the Einstein-Maxwell theory 
would be, e.g., the generation of electromagnetic waves by gravitational waves. 
Because of their smallness, no such effects were ever observed. Accordingly, the 
interaction of a classical electromagnetic field Fij in the form of a lightray with 
a prescribed gravitational field gij{x) is well described by means of Eqs.(3). 
Nevertheless, further consequences of these equations need to be compared with 
experiment as soon as more sensitive measuring methods are available. 

2 On the Equivalence Principle 

According to Einstein’s equivalence principle, see [20], gravity can be locally 
simulated in a gravity-free region of spacetime by going over from the Cartesian 
coordinates, anchored in an inertial frame of reference (including an intertial 
clock) and used in (1), to arbitrary curvilinear coordinates yielding a non-inertial 
frame in general, as in (3). In this context, the metric (/ij, occurring in (2) and in 
the semicolons of (3), is understood as a fiat metric in curvilinear coordinates. 
Thus, the minimal coupling can be interpreted, in a first step, just as a coordi- 
nate transformation from Cartesian to curvilinear coordinates. And, moreover, 
it identifies the metric as the gravitational potential. 

On the other hand, let us assume that we are in a region with gravity and 
(2) and (3) are valid together with the Einstein equation for the metric. Then, 
also according to Einstein’s equivalence principle, we must be able to pick suit- 
able coordinates such that locally the equations look like in special relativity 
in Cartesian coordinates. In Riemannian geometry, the local coordinates are 
called Riemannian normal (hence geodesic) coordinates at one point P, if the 
Christoffel symbols 

^ij 2 ^ ( 5*0 9jhi ~ (®) 

vanish at P and and the metric becomes Minkowskian: 

r./|p = 0, 5y|p = diag (+1,-1, -1,-1). (7) 

Accordingly, the semicolon becomes a comma and the metric in (2), at one given 
point, looks fiat. 

Still, the curvature is non- vanishing, of course: Rijk’’\p yf 0. The equations 
look fiat since they contain only first derivatives. If they contained second deriva- 
tives, then the semicolons goes to comma rule and its reverse would not work 
since on that level not only the Christoffels enter but potentially also the curva- 
ture which, in contrast to the Christoffels, is a tensor and cannot be nullified by 
means of a suitable choice of coordinates. For that reason, the minimal coupling 
procedure, being in this context an expression of the equivalence principle, must 
be applied only to first order differential equations. The safest thing is then to 
apply it, as a rule, only on the level of a Lagrangian, since there ordinarily only 
first-order expressions are allowed for. Non-minimal couplings of the gravita- 
tional field to electromagnetism have also been investigated, see Prasanna [74], 
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Buchdahl [8], Goenner [23], and Miiller-Hoissen [58-60], for example, or for light 
rays in non-minimally coupled theories, see Drummond and Hathrell [17], but 
the price one has to pay is to introduce a new constant of nature; and there is 
no evidence for such a constant in nature - unless one takes the Planck length 
itself. We will come back to these questions in Sec. 7. 

Therefore we can conclude that the equivalence principle and minimal cou- 
pling work well for the Maxwell-Lorentz equations (1) and that they lead to 
experimentally established equations. 

Wave Equation for the Electromagnetic Field Strength. We hasten to 
add that, within the framework of the minimally coupled Maxwell-Lorentz equations, 
we find 2nd derivatives if we derive the wave equation for the electromagnetic field 
strength F — and then also curvature terms are expected to emerge. This is exactly 
what happens, as already found by Gordon [24] and Eddington [18]. 

In the framework of exterior calculus (see Frankel [21]), let us consider the elec- 
tromagnetic field strength 2-form F — ^ Fij dx* A dx^ . In Maxwell-Lorentz vacuum 
electrodynamics, it satisfies dF = 0, eod *F = ^ J. We denote the codifferential by 
5 := *d*. Then we find, with the wave operator (d’Alembertian) 

\Z\ '■= S d + d 5 , (8) 

and by using the Maxwell-Lorentz equations, the wave equation 

□F = — d *J , (9) 

SoC 

see [53]. The left hand side of this equation, in terms of components, can be determined 
by substituting (8): 

□ F = ^ Fij + 2Ric[i'° Fj]k - R^Uj Fm^ dx* A dx^ . (10) 

Accordingly, minimal coupling can lead to curvature terms of a prescribed form.^ 



3 A Caveat 

Soon after general relativity had been proposed, it became clear, see Einstein 
[19], that one can introduce as auxiliary variables the densities 

:= g^\x) g^\x) Fki , ( 11 ) 

with g(x) := det gij{x), in terms of which the Maxwell-Lorentz equations (1) can 
be rewritten in a metric-free way as 

+ F,k,i + = 0 . (12) 

Of course, we could have non-minimal coupling as, e.g., in nF-l-y (ectJe/ 3 jF“^) AF = 
-F d *J , see Sec. 7. 

Sgc ’ 



1 
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Similarly, the charge conservation law P-i = 0 can be put in the form 

l\i = 0. (13) 

The metric enters only via the densities defined in (11). In fact, if we started from 
the set (12) in special relativity right away, then no comma goes to semicolon 
rule would have been necessary: These equations are generally covariant, they 
are valid in arbitrary curvilinear coordinates, be it in the framework of special 
or general relativity theory. 

In the calculus of exterior differential forms (Cartan calculus), see Frankel 
[21], these equations can be formulated very succinctly. We introduce the electric 
current as odd 3-form, 

J := p — j f\dt = ^ Jijk dx* A dx^ A dx* , (14) 

the electromagnetic excitation as odd 2-form 

H = V-nAdt= ^Hijdx^ Adxff (15) 

and the electromagnetic field strength as even 2-form 

F = B + E Adt= ^Fijdx^ AdxP (16) 

Then (12) reads 

dH=J, dF = 0, (17) 

with 

dJ = 0. (18) 

The set (17) represents the Maxwell equations. They are independent of metric 
and connection. The constitutive relation for the vacuum reads 

H= f^*F, (19) 

V Mo 

where the star * represents the metric-dependent and odd Hodge duality oper- 
ator. Eq.(19) corresponds to (ll)i and I* can be related to the components of 

d, 

, r = 4 , (20) 

Z! V ^0 3! 

with ed** = ±1,0, the totally antisymmetric Levi-Civita tensor density. 

Now the equivalence principle looks empty: Since the Maxwell equations (17) 
are formulated in a coordinate and frame independent way, they are valid in this 
form in arbitrary coordinate systems and frames, be it in a fiat or in a curved 
spacetime. Only the constitutive relation (19) “feels”, up to a conformal factor, 
the presence of a fiat or a non-fiat metric, i.e., the constitutive relation couples to 
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the conformally invariant part of the metric. The coupling of electromagnetism 
to gravity becomes almost trivial. Is all this just a mathematical trick, which 
distracts from the physical content of Maxwell’s theory, or is it more? 

One further observation hints also at the need for clarification. The Einstein- 
Cartan theory of gravity is a viable gravitational theory, see [30,51,86]. It is the 
simplest model of the metric-affine gauge theory of gravity, see [31,25]. In the 
Einstein-Cartan theory, spacetime is described by means of a Riemann-Cartan 
geometry with torsion and curvature.^ If we couple (3) to gravity, do we have to 
use the semicolons as covariant derivatives with respect to the Riemann-Cartan 
connection or still with respect to the Christoffels, see [14]? In the context of 
(17), this question cannot even be posed, since the exterior derivative d is all 
what is needed. Are then the equations (17) misleading as a starting point for 
coupling to gauge gravity? What could be the appropriate starting point? 

Provided one formulates Maxwell’s theory and its coupling to gravity in 
terms of a Lagrangian with the electromagnetic potential A as variable, gauge 
covariance of the formalism results in (17) cum (19), as was pointed out by Benn, 
Dereli, and Tucker [5]. However, we would like to have some more immediate 
insight into the structure of electromagnetism as induced by experiment even 
without having a variational formulation at our disposal. 

4 Electric Charge and Magnetic Flux Conservation 

The metric is a quantity which allows to define lengths and angles in spacetime. 
There are, however, laws in physics which don’t require the knowledge of a 
metric. Take the conservation law of electric charge as an example. Mark a 3- 
dimensional simply connected submanifold f? 3 . We know from experiment that a 
possible electric charge inside 173 is composed of charge “quanta”, i.e., there is an 
integer number of elementary charges in f2^. Recent advances in technology made 
it possible, see [13,44], to trap and to count single electrons and protons. Thus, 
as soon as we have such quanta available, we can rely on counting procedures, 
see Post [73] , the use of a meter stick or a chronometer is superfluous under such 
circumstances. 

Electric charge conservations is experimentally well-established and is one of 
the pillars electromagnetism rests on. We formulate it, following Kottler-Cartan- 
van Dantzig, see [72,87] and also [32], most appropriately as an integral law. 
According to (14), we assume the existence of the odd electric current 3-form J. 
We take charge conservation as axiom 1, that is, J integrated over a closed 3- 
dimensional hypersurface f ?3 has to vanish, if this hypersurface is the boundary 
of a connected 4- volume 174 : 

j J = j dJ = 0, or dJ = 0. (21) 

df24 i?4 

^ A proper discussion of the equivalence principle in the context of Einstein-Cartan the- 
ory requires the introduction of local coframes, see [25,27,38]. Being concerned here 
only with electromagnetism, it is sufficient to use natural, i.e., holonomic coframes. 




How Does the Electromagnetic Field Couple to Gravity? 485 



Here we applied the Stokes theorem. 

If (21) is assumed to be valid J = 0 for all three-cycles dCs = 0, C 3 yf 
dJ74, then, according to a theorem of de Rham, J is exact, see [73]. Thus the 
inhomogeneous Maxwell equation is a consequence, 

J = dH, (22) 

with the odd 2-form H of the electromagnetic excitation, see (15). The excitation 
is only determined up to an exact form. Nevertheless, the electric excitation T> 
can be measured by means of Maxwellian double plates as charge per unit area, 
the magnetic excitation % by means of a small test coil, which compensates 
the "H-field to be measured, as current per unit length. This is possible since in 
theses null experiments vanishing field strength F implies vanishing excitation 
FI, see [32]. In other words, the extensive quantities T> and % - and thus the 
4-dimensional excitation iJ - have an operationally significance of their own, 
since they are related to charge at rest or in motion, respectively. Accordingly, 
the somewhat formalistic introduction of the densities in ( 11 ) has now been 
legitimized as a transition to operationally meaningful additive quantities. Note 
that up to now only the differential structure of the spacetime was needed, a 
metric has not been involved. 

Let us choose a field of 4-frames Cq, and consider the motion of a point particle 
with respect to the reference frame thus defined. As axiom 2 one can take an 
operational definition of the electromagnetic field strength F via the Lorentz 
force density 

/a = (eajF)AJ. (23) 

The interior product (contraction) is denoted by J . The force density fa is a 
notion from classical mechanics. It is an odd covector- valued 4- form. Accordingly, 
Eq.(23) can be read as a definition of the even 2-form F, see (16). Again, we 
don’t need a metric. And we know the recipe of how to proceed in the same 
manner. 

That F can be interpreted as magnetic flux is obvious if we choose i ?2 as 
a ‘spacelike’ surface (strictly, at this point we don’t know what spacelike means; 
we will come back to this later) . In superconductors under suitable circumstances 
we can count (in an Abrikosov flux line lattice) quantized magnetic flux lines. 
This suggests that magnetic flux is a conserved quantity (axiom 3): 

J F = j dF = 0 or dF = 0. (24) 

In this way, by means of the axioms (21), (23), and (24), we recovered the 
fundamental structure of Maxwell’s theory: dH = J, dF = 0 . This is what had 
been called metric-free electrodynamics? What is missing so far is the relation 
between the excitation FI and the field strength F, and it is exactly there where 
the metric, i.e., the gravitational potential comes in. 

® Stachel [83] calls it generalized electrodynamics. We don’t follow this suggestion, 
since Maxwell’s equations were originally given in terms of {T>, H) and {E, yM) in 
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5 No Interaction of Charge 

and Flux “Substrata” with Gravity 

We now understand that the inhomogeneous Maxwell equation dH = J, as 
an expression of electric charge conservation, cannot be influenced by gravity, 
i.e., by the metric tensor g, or, in the case of metric-affine gravity or its specific 
subcases, such as Einstein-Cartan theory, by the connection F of spacetime. The 
electric charge substratum of spacetime has rules of its own. Spacetime can be 
deformed by the presence of metric and connection, but the charge substratum 
and the net electric charge to be attributed to a prescribed 3-dimensional (3D) 
volume won’t change. Thus the additivity 3D volume- wise of the charge lays 
at the foundation of the Maxwellian framework. And it translates into the 2D 
additivity of the integrated excitation iJ - this being the reason why one 
uses this integral for the operational interpretation of FI . 

Similar arguments can be advanced for the homogeneous Maxwell equation 
dF = 0. However, first of all it should be stressed that the axiomatics we are 
using strongly suggests the non-existence of magnetic charges. If there were 
magnetic charges, then we would have no reason to believe in electric charge 
conservation either; compare for this argument axiom 1, Eq.(21), with axiom 3, 
Eq.(24). Conventionally, the inhomogeneous equation dH = J is seen in analogy 
to the homogeneous one dF = 0. But not so in the framework of our axiomatics 
which has a firm empirical basis. We put dJ = 0 in analogy to dF = 0. The whole 
historical development of electromagnetism, starting with 0rsted and Ampere, 
points to the elimination of the phenomenologically introduced magnetic charges. 
Most recent experiments, see [1,29], exclude magnetic charges with very good 
precision. Thus theoretical as well as experimental evidence speak against the 
existence of magnetic charges. 

Having said this, we hasten to add that, nevertheless, there is some kind 
of magnetic substratum in spacetime, namely the magnetic flux F . It is a 

substratum of its own right. The fluxoids, the quantized magnetic flux lines in 
superconductors, see [84], do convey a clear message. Besides electric charge^, 
magnetic flux® (and not magnetic charge) has an independent standing in elec- 
tromagnetism, too. Thus rightfully, it is governed by an own axiom, namely 
axiom 3. 

Axiom 3 is again a conservation theorem. In contrast to axiom 1, which has 
a fermionic smell, axiom 3 is more of a bosonic nature. Moreover, magnetic flux 
adds up 2D area-wise. For this reason, magnetic flux is represented by a 2-form 
and not, like the charge, by a 3-form. Accordingly, there are essential differ- 
ences between these two conservation laws which express the peculiarities of the 

a form ‘isomorphic’ to the (1 -I- 3)-decomposition of dH = J and dF = 0, see [50]. 
Therefore, the “generalized” Maxwell equations, dH — J and dF = 0, correspond 
in actual fact, just to Maxwell’s equations (modulo the substitution B — >■ gH). And 
this is how we will name them. 

^ STunit Coulomb, elementary charge e = 1.60217733 x 10~^® C. 

® Sl-unit Weber, elementary fluxoid fe/(2e) = 2.06783461 x 10”^® Wb. 
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electromagnetic phenomena. Electric and magnetic effects enter the Maxwellian 
framework in an asymmetric way, in spite of all that talk about a duality between 
electricity and magnetism. But there is also a similarity in that both axioms are 
formulated as integral conservation laws. The possibility to count the ffuxoids 
assures us that axiom 3 has to be again a law free of metric and connection. 

Incidentally, there is a nice visualization of the fundamental quantities en- 
tering electrodynamics. If one describes the quantum Hall effect for low lying 
Landau levels, then the concept of a composite fermion is very helpful: it consists 
of one electron and an even number of ffuxoids is attached to it, see Jain [41,42] 
and [43]. Isn’t that a very clear indication of what the fundamental quantities 
are in electrodynamics? Namely, electric charge (see axiom 1) and magnetic flux 
(see axiom 3), see also Nambu [61] in this general context. 

Our conclusion is then that, as long as we opt for electric charge and mag- 
netic flux conservation, the Maxwell equations in gravity-free regions, i.e., in 
the Minkowski spacetime of special relativity, read dH = J and dF = 0; they 
remain the same irrespective of the switching on of gravity, be it in Einstein’s 
theory, in metric-affine gravity (see [76]), or in any other geometrical theory of 
gravity. 



6 Constitutive Law of Electrodynamics 
and Its Relation to Gravity 

After having discussed extensively that gravity does not influence the Max- 
well equations, we eventually turn to the constitutive law via which gravity 
does influence electrodynamics. It is true, the charge substratum and the flux 
substratum themselves do not couple to gravity, as we have shown in the last 
section. However, the interrelationship between both substrata is affected by 
gravity. Metaphorically speaking, the “flow” of each of the substrata is ruled by 
a particular gravity-independent conservation theorem, but the flows of electric 
charge and magnetic flux are coupled via a gravity-dependent constitutive law 
since, in the end, magnetism has to be expressed in terms of electricity. 

Let us choose arbitrary local spacetime coordinates xL Then we have, 

iL = ^ Hij dx* A dx\ ^ ~ \ ^ ' 

We will turn first to the electrodynamics of material media in order to develop 
some intuition on the concept involved, but eventually, it will be the vacuum, be 
it in inertial or non-inertial frames, which will occupy the center of our interest. 

6.1 Non-local 

Moving macroscopic matter defines a (1 -I- 3)-splitting of spacetime specified by 
a well defined average 4-vector velocity field u which describes the congruence 
of worldlines of the flow of the medium. Such a vector field can be defined 
operationally from the motion of matter as follows. 
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Let a 3-dimensional arithmetic space be equipped with the coordinates 
a = 1, 2, 3. We will use these coordinates (known as Lagrange coordinates in 
continuum mechanics) as labels which enumerate elements of a material medium. 
A smooth mapping X(o) '■ — >■ X 4 into the spacetime defines a 3-dimensional 

space domain (hypersurface) V which represents the initial distribution of mat- 
ter. In local spacetime coordinates, this mapping (or labeling) is given by the four 
functions a^(n-)(C^)- It should be preserved at any time, i.e. along any worldline 
of a particular element its labels are constant. 

Given the initial configuration V of matter, we parameterize dynamics of 
the medium by the “time” coordinate r which is defined as the proper time 
measured along an element’s worldline from the original hypersurface V. The 
resulting local coordinates (t, ^“) are usually called the normalized comoving 
coordinates. Thus finally, the motion of matter is described by the functions 
cc*(r, ^“). Subsequently, we define the 4- velocity vector field by 



u := dr 



/c^\ 

V const 



(26) 



Evidently, a family of observers comoving with the matter is characterized by 
the same timelike congruence a;*(r, ^“). They are making physical (in particular, 
electrodynamical) measurements in their local reference frames which drift with 
the material motion. 

One says that a medium, moving in general, has dispersion properties when 
the electromagnetic fields produce non-instantaneous polarization and magne- 
tization effects. The most general linear constitutive law is then given, in the 
comoving system, by means of the integral 



H^AT,0=\ I dT'K,j>^^{T,T')Fu{r',0- 



(27) 



The coefficients of the kernel are called the response functions. We 

expect the metric to be involved in their set-up. Their form is defined by the 
internal properties of matter and by the motion of a medium. 

Mashhoon [49] has proposed a physically very interesting example of such a 
non-local electrodynamics in which non-locality comes as a direct consequence of 
the non-inertial dynamics of observers. In this case, instead of (25), one should 
use the field expansions 

H (28) 

with respect to the coframe of a non-inertial observer = ej“ dx^ . The consti- 
tutive law is then replaced by 

Ho.p{T,0 =IJ dr'K^ 0 ^\T,T') F^s{r',0 , (29) 



and the response kernel in (29) is now defined by the acceleration and rotation 
of the observer’s reference system. It is a constitutive law for the vacuum as 
viewed from a non-inertial frame of reference. 
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Mashhoon imposes an addititional assumption that the kernel is of convolu- 
tion type, i.e., Kap'^^{T,T') = Kap'^^ir — t'). Then the kernel can be uniquely 
determined by means of the Volterra technique, and often it is possible to use 
the Laplace transformation in order to simplify the computations. Unfortunately, 
although Mashhoon’s kernel is always calculable in principle, in actual practice 
one normally cannot obtain K explicitly in terms of the observer’s acceleration 
and rotation. 

Preserving the main ideas of Mashhoon’s approach, one can abandon the 
convolution condition. Then the general form of the kernel can be worked out 
explicitly {u is the observer’s 4- velocity): 

Ka(3^\T,T') = 5 {t - t') - wJ (t')) . (30) 

The influence of non-inertiality is manifest in the presence of the connection 
1-form. The kernel (30) coincides with the original Mashhoon kernel in the case 
of constant acceleration and rotation, but in general the two kernels are differ- 
ent [55]. Perhaps, only the direct observations would establish the true form of 
the non-local constitutive law. However, such a non-local effect has not been 
confirmed experimentally as yet. 



6.2 Non-linear 



But the constitutive law can also be non-linear (or non-local and non-linear at 
the same time). In the local and non-linear Born-Inf eld electrodynamics [6], with 
the dimensionfull parameter /e as maximal attainable electric held strength, we 
have , 



dVBi 9^-det\gki + j;FM\ 
dF ^ dF 



(31) 



The metric as symmetric second rank tensor enters here in a very natural way. It 
adds up with the antisymmetric electromagnetic held to an asymmetric tensor 
- much in the way Einstein had hoped to And for his unified held theories of 
gravity and electromagnetism. By differentiation, we And 



H = 




*F+ ^F)F 

- Ji*{F ^^F) - ^F)Y' 



(32) 



now the metric being absorbed in the (odd) Hodge star operator, see [21]. For 
/e — >■ oo, we recover the conventional local and linear Maxwell-Lorentz theory 
for vacuum with H = -\J^*F. The Born-Infeld electrodynamics is presently 

used as a toy model in string theories, see [22]. The problem with Born-Infeld 
electrodynamics is that, in contrast to Maxwell’s theory, it defies quantization. 
It is an interesting model, but nothing like an established theory. 

A similar example is the non-linear Heisenberg- Euler electrodynamics [34]. 
Quantum electrodynamical vacuum fluctuations yield corrections to Maxwell’s 
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theory that can be accounted for by an effective constitutive law constructed 
by Heisenberg and Euler. To the first order in the fine structure constant Of = 
it is given by (see also [39,35]) 



H = 







14 Off 

4 ^ 



{FAF)F 



(33) 



2 2 Q 

where Bk = « 4.4 x 10^ T, with the mass of the electron m. The met- 

ric is again hidden in the Hodge star and the Maxwell-Lorentz limit results 
analogously for m — >■ oo. The Casimir force between two uncharged electrically 
conducting plates, also an effect of vacuum fluctuations, has been experimen- 
tally verified as have been non-linear effects in the “superposition” of strong 
laser beams. Accordingly, the non-linear constitutive law (33) is a valid post- 
classical approximation of vacuum electrodynamics and as such experimentally 
confirmed. 

Note that these variants of classical electrodynamics respect charge and flux 
conservation. This underlines the fact that our axiomatics clearly points to that 
structure of electrodynamics, namely the constitutive law, which can be changed 
without giving up the essentials of electrodynamics. 

Both, Eqs.(32) and (33) are special cases of Plebahski’s more general non- 
linear electrodynamics [71]. Let the quadratic invariants of the electromagnetic 
field strength be denoted by 

/i := i*(E A*E) = i(£;2 - B^) and I2 :=]^*{F A F) = E ■ B , (34) 

where I\ is an even and I2 is an odd scalar (the Hodge operator is odd). Then 
Plebahski postulated a non-linear electrodynamics with the constitutive law® 



H = U{h,l2YF + V{h,l2)F, 



(35) 



where U and V are functions of the two invariants. Note that in the Born-Infeld 
case U and V depend on both invariants whereas in the Heisenberg-Euler case 
we have 17he(7i) and Ere ( 72)- Nevertheless, in both cases U is required as well 
as V. And in both cases, see (32) and (33), U is an even function and V and 
odd one such as to preserve parity invariance. 

If one chose V to be an even function, e.g., then parity violating terms would 
emerge. Such terms were most recently discussed by Majumdar, Mukhopad- 
hyaya, and SenGupta [48,56]; for the experimental situation (there seem no 
signatures for parity violations) compare Lue et al. [47]. 



Singularity-Free Electro-Gravitodynamics. Recently, Ayon-Beato 
& Garcia [2], for earlier work see Shikin [79], have proposed a constitutive law 

H = U{h)*F, (36) 

® Strictly, Plebahski assumed a Lagrangian which yields (17) together with the struc- 
tural relations F = u{Ii, I 2 )* H -j- v{li, I 2 ) H . The latter law, apart from singular 
cases, is equivalent to (35). 
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which, as subcases, does neither encompass (32) nor (33) and thus makes it 
appear as rather academic. The explicit form of U is defined by the requirement 
of obtaining completely singularity-free solutions of the coupled system of the 
gravitational field (Einstein) and the electromagnetic field (non-linear Maxwell). 
Examples of suitable functions U{Ii) are given in [79,2]. 

In terms of the local time and space coordinates (t, r, 6, (j>), the general spher- 
ically symmetric ansatz for the coframe can be written as 

= /(^) dt , = -rrr dr , {P' = r d6 , = r sin 9 d(j) , (37) 

f(r) 

whereas, for the electromagnetic field, we have 



F = ip{r) tl® A . 



(38) 



The exact solution of the coupled system of gravitational and electromagnetic 
field equations, i.e., of Einstein’s equation (4) and Maxwell’s equations dF = 
0, dH = 0, reads 



= g f2 ^ 1 _?!!(: _L 

^ C/(/i)r2’ ^ ^ ^2 ’ 

where q,m are integration constants and (‘Tolman’s integral’) 



(39) 



Q{r) = nr 




h 

IC = 2IiU{Ii)- J dl[U{l[). 



(40) 



In the last function one should substitute the explicit form of the quadratic 
invariant Ii computed on the spherically symmetric configuration (37) with (38). 

It is shown in [79,2-4] that the constitutive function U{Ii) can be chosen 
in such a way that the functions in (39) describe a completely regular, i.e., 
singularity- free configuration. 



6.3 Linear: Abelian Axion, inter Alia 

A very important case is that of a linear constitutive law between the compo- 
nents of the two-forms FI and F. It postulates the existence of the 6 x 6 = 36 
constitutive functions such that 

^ Efcp (41) 

This kind of an ansatz we know from the physics of anisotropic crystals. The 
factor 1/2 is chosen in order to have a smooth transition to the conventional 
D = SosE etc. relations, cf. [72] p.l27. 

The choice of the local coordinates is clearly unimportant. In a different coor- 
dinate system the linear constitutive law preserves its form due to the tensorial 
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transformation properties of Alternatively, instead of the local coordinates, 
one may choose an anholonomic frame and may then decompose the two-forms 
H and F with respect to it. 

Since H is an odd and F an even form, the constitutive functions 
are odd. Taking the Levi-Civita symbol, we can split off the odd piece according 
to 



^2-? • ^ ^2 



'ikl ^ Jjmn kl 

A — 2 ^ ^mn 



(42) 



Because of the corresponding properties of the Levi-Civita symbol, the are 
even scalar densities of weight -1-1. For the Levi-Civita symbols with upper and 
lower indices, we have tmnpq = 

With the linear constitutive law (as with more general laws), we can set 
up a Lagrangian 4-form; here we call it Idin- Because of iL = — dVun/dF, the 
Lagrangian must be quadratic in F. Thus we find 

Viin = — t:HAF= — ^ HijFpq dx^ A dx^ A dx^ A dx'^ 

2 8 

= - ^ PkiFpq dx° A dx^ A dx^ A dx^ . (43) 

The components of the field strength F enter in a symmetric way. Therefore, 
without loss of generality, we can impose the symmetry condition 
on the constitutive functions reducing them to 21 independent functions at this 
stage. 

The Kij^^ carry the dimension [k] = [y] = e^/fi. Therefore, still before intro- 
ducing the metric, we can split off the totally antisymmetric part of and 
define the dimensionless constitutive functions according to 



^ijki ^ j ° yfci ^ ^ijki ^ ° [ijki] ^ Q ^ (44) 



Here [/] = [a] = h/e^, and / = f{t,x) and a = a{t,x) represent one scalar and 
one pseudo-scalar constitutive function, respectively. Thus the linearity ansatz 
eventually reads 



H,, = - e,jmn Fm = Fe^jran X Fm + a F,, , 



with 



^mnkl ^nmkl ^mnlk ^klmn cUld ^[mnkl] q 



(45) 

(46) 



O , . 

i.e., besides a, we have 20 independent constitutive functions. Thus y has 
the same algebraic symmetries and the same number of independent components 
as a curvature tensor in a Riemannian spacetime. 

Pseudo-scalars are also called axial scalars. So far, our axial scalar a{x) is 
some kind of permittivity/permeability field. If one adds a kinetic term of the a- 
field to the electromagnetic Lagrangian (43), then a{x) becomes propagating and 
one can call it legitimately an Abelian^ axion. Ni [62] was the first to introduce 

^ In contrast to the axions related to non-Abelian gauge theories, see [91,92,54] and 
the reviews in [45] and [80]. 
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such an axion field a in the context of the coupling of electromagnetism to 
gravity, see also deSabbata & Sivaram [16] and the references given there. 

The Abelian axion has the following properties: 

• Pseudoscalar field, i.e., spin = 0, parity = —1. 

• Couples to Maxwell’s field in the Lagrangian according to aFAF = 2a EA 
BAdt, see (45) and (43). Here E is the 3-dimensional electric field 1-form and 
B the corresponding magnetic field 2-form. This term in the total Lagrangian 
can be written as 

aFAE = — daAAAF, (47) 

dropping, as usual, the total derivative. This contributes to the excitation 
H = —dL/dE a term 

^ da A A. (48) 

• Since it arises on the same level as the metric, see Eq.(64) below, it is a field 
of a similar universality as the gravitational field. 

As yet, the Abelian axion has not been found experimentally, see the discus- 
sion of Cooper & Stedman [12] on corresponding ring laser experiments. 

6.4 Isotropic 

The linearity ansatz (45) can be further constrained in order to arrive eventually 
at an isotropic constitutive tensor. We will proceed here somewhat unconven- 
tional in that we don’t assume a metric of spacetime beforehand but rather 
derive it in the following way: 



Duality Operator, Electric and Magnetic Reciprocity. The constitutive 
tensor X of (45) defines a new duality operator which acts on 2-forms on 
X. In components, an arbitrary 2-form 0 = \0ij dx^ A dx^ is mapped into the 
2-form by 

(49) 

see [67,33]. No metric is involved in this process. Now the linear material law 
(45) can be written as 

H={f* + a)F. (50) 

We postulate that the duality operator, applied twice, should, up to a sign, 
lead back to the identity. Such a closure relation or the “electric and magnetic 

O 

reciprocity” [85] reduces the number of independent components of X to 9 (with- 
out using a metric). One can demonstrate that this is a sufficient condition 
for the nonexistence of birefringence in vacuum, see [62,63,46,64,28]. Then the 
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fourth-order general Fresnel equation degenerates to the second-order light cone 
equation. Therefore, we impose 

** = -!. (51) 

The minus sign yields Minkowskian signature®, whereas the condition = 4-1 
would lead to Euclidean or to the mixed signature (-I-, -b, — , — ). 

Seemingly Toupin [85] and Schonberg [77] were the first to deduce the confor- 
mally invariant part of a spacetime metric from duality operators and relations 
like (49) and (51). This was later rediscovered by Jadczyk [40], whereas Wang 
[90] gave a revised presentation of Toupin’s results. A forerunner was Peres [69] , 
see in this context also the more recent papers by Piron and Moore [70]. Brans 
[7] and subsequently numerous authors discussed such structures in the frame- 
work of general relativity theory, see, e.g., [9,37,26,68] and the references given 
there. 

It is convenient to adopt a more compact bivector notation by defining the 
indices = (01,02,03,23,31,12). Then ybfc* becomes the 6x6 matrix 

X and (51) goes over into 

X ejK X clm = —5m ■ (52) 



In terms of 3 x 3-constituents an arbitrary symmetric X =X constitutive 
matrix reads 



X Ji = 



A C 
C'^ B 



e" = e" 




(53) 



where A = A^ , B = B^ , and the superscript ^ denotes transposition. The 

O T j 

algebraic condition ejj X = 0 is provided by tr C = 0. 

The general non-trivial solution of the closure relation (52) can be written in 
the form 



O 

X 



IJ 



fpB-^+qN B~^K\ 
-KB~^ B J ■ 



(54) 



Here H is a nondegenerate arbitrary symmetric 3x3 matrix (6 independent com- 
ponents Bab), K an arbitrary antisymmetric matrix (3 independent components 
Kab =■ iabck‘^), N the symmetric matrix with components := k°'k^, and 
q := — 1/det B, p := [tr{NB )/ det B] — 1. Thus, Eq.(54) subsumes 9 independent 
components. 



Triplet of Self-Dual 2-Forms. The duality operator induces a decom- 
position of the 6-dimensional space of 2-forms into two 3-dimensional invari- 
ant subspaces corresponding to the eigenvalues ±t. Writing the 2-form basis 

® One could define a different duality operator by = jCijki X*^**"" 0mn such that 

## ^ _J2^ 
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0^ = dx* A dx^ in terms of the two 3-dimensional column vectors 



0^ 




a,b,--- = 1,2,3, 



(55) 



one can construct the corresponding self-dual basis 0^ := ^{0^ — i "^0^). In the 
3-vector representation, 



(s) 

0 



I 



(s) 

/3“ 



(s) 

^ 6 / 



(56) 



one of the 3-dimensional invariant subspaces can be spanned either by the up- 
per or by the lower components which are related to each other by a linear 

(s) (s) 

transformation. For example, f3 can be expressed in terms of 7 according to 

(s) (s) (s) 

(3 = {i + B~^K)B~^ 7 . Therefore 7 or, equivalently, the triplet of 2-forms 



:= -(^-1)“*' 7h 



= - (dx^ A dx^ — (det B) ^ kb dx^ A dx°' 
i {B~^)°'^ ebcd dx'^ A dx'^). 



(57) 



subsume the properties of this invariant subspace. Each of the 2-forms carry 3 
independent components, i.e., they add up to 9 components. 

o 

The information of the constitutive matrix X is now encoded into the 
triplet of 2-forms . One can verify that the latter satisfies the completeness 
relation 

S0) A ^ (5-1)“^- {B)^d A . (58) 



Extracting the Metric. Within the context of SU (2) Yang-Mills theory, Ur- 
bantke [88] was able to derive a 4-dimensional spacetime metric Qij from a triplet 
of 2-forms satisfying a completeness condition of the type (58). Explicitly, the 
Urbantke formulas read 

^ V^eabc , (59) 

^ . (60) 

The 5'-“^ are the components of the 2-form triplet 5'^“^ = S^j'^dx^ A dx^/2. If we 
substitute the forms (57) into (59) and (60), we can display the metric explicitly 
in terms of the constitutive coefficients: 



9tj = 



1 1 


f detB 


ka 


\/det B 


^ -kb 


-Bab + (det B)-^ ka kb 



(61) 
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Here ka '■= Babk^ = Bab One can verify that the metric in (61) has 

Minkowskian signature. Since the triplet S'O) is defined up to an arbitrary scalar 
factor, we obtain a conformal class of metrics. 

Given a metric, we can now define eventually the notion of local isotropy. 
Let be the contravariant coordinate components of a tensor field and 

rpai...ap . .v j^g frame components with respect to an or- 

thonormal frame Cq. = e*Q di. A tensor is said to be locally isotropic at a given 
point, if its frame components are invariant under a Lorentz rotation of the 
orthonormal frame. Similar considerations extend to tensor densities. 

There are only two geometrical objects which are numerically invariant under 
local Lorentz transformations: the Minkowski metric Oq,^ = diag(-|-l, —1, —1, —1) 
and the Levi-Civita tensor density CajS-yS- Thus 

= <f{x) + ip{x) (62) 

is the most general locally isotropic contravariant fourth rank tensor density of 
weight -1-1 with the symmetries _ _q-ijik _ j-khj ^ Here (j) and (p 

are scalar and pseudo-scalar fields, respectively. 

One can prove that the constitutive tensor in (45) with the closure property 
(51) is locally isotropic with respect to the metric (61), see also [63]. Accordingly, 
for the constitutive tensor, we finally have 

= (63) 

Thus, the isotropic law reads 

^ Fki + a F,, (64) 

or, if written with the help of the Hodge star operator belonging to the metric 

(61), 



H={f* + a)F. 



(65) 



6.5 Centrosymmetric 

If we want the constitutive tensor to be reflection symmetric at each point of 
spacetime, i.e., if we require centrosymmetry, then we have to kill the Abelian 
axion and arrive, provided F is chosen in accordance with the Sl-conventions, 
at the usual law for Maxwell-Lorentz vacuum electrodynamics® 

H=f*F=.[^*F. (66) 

V 1^0 

® Remember that in Ricci calculus the excitation is defined according to T-C^Ric = 
Then see (ll)i. 
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7 Non-minimal Coupling Involving Curvature, 
Nonmetricity and Torsion? 

7.1 Non-minimal Coupling Violating Charge 
and/or Flux Conservation 

The Maxwell equation dH = J reflects (and comes from) the electric current 
conservation, dJ = 0, see Sec. 4. By modifying the left hand side of dH = J, 
one can arrive at a model violating charge conservation. Such a modification can 
typically originate from a non-minimal coupling of the electromagnetic to the 
gravitational held. Given the torsion 2-form T“, one can consider, for example, 
the held equation 

dH + a (e„jT“) AiJ-h/3*(r^a AT“) Ail = J, (67) 

or, with the nonmetricity 1-form Qaf 3 ■= —Dgaff and the Weyl 1-form Q := 
Qc^/4., 

dH + jQ AH + S*{d°‘ AQc,0) AH = J . (68) 

Similar non-minimal terms could emerge in dF = 0. However, curvature depen- 
dent terms cannot be accommodated at the level of the Maxwell equations, since 
the contraction of the indices produces always a form of even rank whereas the 
Maxwell equations are represented by 3-forms, i.e., by forms of odd rank. In any 
case, violating charge or flux conservation is not possible without giving up most 
of the experimentally established structure of the theory of electromagnetism. 
Therefore we will not follow this path. 

Incidentally, there are some papers in the literature in which the conventional 
vacuum constitutive law H ~ *F is uphold, but the Maxwell equations are 
coupled to torsion in an inconsistent way. A closer inspection of the papers 
[75,15,82] shows that the proposed “non-minimal” coupling of torsion to the 
electromagnetic held is void of physical contents. In fact, torsion drops out if the 
algebra is done correctly. 

Another procedure comes to mind if we talk about the violation of the conser- 
vation laws. Hojman et al. [36] introduced a new scalar held the tlaplon, see 
also Mukku & Sayed [57]. The gradient dip of the tlaplon was put proportional 
to the trace part := A T of the torsion T“, see [31]; here T := eplT^ . 

In fact, we have T = ^ dp. 

Superficially, the axial scalar a{x) (Abelian axion) and the scalar tlaplon p{x) 
may look similar. However, the axion already emerges from spacetime viewed 
as a differential manifold as soon as a linear constitutive law is assumed for 
electromagnetism, whereas the tlaplon can only be introduced if the differential 
manifold in equipped with a linear connection. In other words, the axion is a 
pre-metric and a pre-connection animal, the tlaplon, in contrast, needs to be 
‘housed’ in a linear connection. 

Moreover, as we saw, the axion respects the conservation laws (and pleases 
us thereby), whereas the tlaplon defies these rules and appears as an anti- 
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electromagnetic creature. The electromagnetic field is not defined in the conven- 
tional way, namely hy F = dA. Instead, Hojman et al. define it via a “covariant” 
derivative according to 

F = F-T°‘Aa = F+\rAA = F+ l^dipAA. (69) 

Thus the “electromagnetic” Lagrangian becomes 

FA*F = FA*F + d(pAAA*F+^TAAA*{TAA). (70) 

The last term does not contribute to the excitation F[ = —dL/dF, but the 
second term produces a contribution 

^*{dipAA). (71) 

We can compare now the two contributions from the axion (48) and the 
tlaplon (71). They are reminiscent of each other since one is equal to the Hodge 
dual of the other. Therefore, in the tlaplon case, besides a connection, we need 
additionally a metric. But the most decisive difference is, as can be read off from 
(70), that the Maxwell equations get amended and axiom 1 and axiom 3 are no 
longer valid. 

7.2 “Admissible” Non-minimal Coupling 

The message is then that a change of the Maxwell equations dH = J , dF = 0 
is to be avoided, unless one allows for a violation of electric charge or magnetic 
flux conservation. By introducing the metric into the constitutive law, one 
gets a smooth and natural transition from special relativity to general relativity 
and to gauge theories of gravity. In the constitutive law for vacuum one could 
imagine, along with the contributions depending only on the metric, couplings 
like [23] 

^ ^ ijkl 

+ A 4 + A 5 E + Ag E (72) 

without violating the conservation laws. Here E := e“je/ 3 ji?a^ is the curvature 
scalar, and we denote the right or Lie dual of an so( 1, 3)-valued form ipag by 
C/3 := 

Choose, for example, = 2^/—gfo(l + E) then the inhomoge- 

neous Maxwell equation would read 

d*F + /^^d(E*F) = ^J, (73) 

Jo 

i.e., a coupling of curvature and electromagnetic field strength would be possible. 
However, one had to introduce a new natural constant with the dimension of 
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\(3] = 1/length. On the level of the Lagrangian, this coupling would be non- 
minimal, 

Ko„-m = - y (1 + P'^R) *FAF, (74) 

but such an ansatz would not spoil the fundamental principles of electrodynam- 
ics; it would seem to be the most natural way of achieving an i?i^-coupling. 
Goenner [23], see also the literature given there, derived this non-minimal La- 
grangian from some fundamental principles, like the existence of a decent New- 
tonian limit. However, in his view, such a model violates charge conservation. 
We disagree with him on this point. 

We want to stress that one cannot achieve a similar non-minimal coupling to 
the torsion T“ of spacetime. First of all, the Maxwell equations are independent 
of torsion. By means of the constitutive law one maps the 2-form F to the 2- 
form FI. The curvature Ra^ is a 2-form of type (1, 1), i.e., it carries two GL(4, R) 
indices, whereas the torsion 2-form carries only one index. Therefore, by con- 
traction, we cannot get a scalar out of the torsion. A coupling like (ea\T°‘) A *F 
is not possible, since this is a 3-form. However, higher powers in T“ would be 
possible such as 

= /o [1 + 7 "*(T“AT„)] A*F (75) 

or 

H = fo[l + S^ * ((e„J T“) A (e^J T^) A (e^J T'^) A (e^J T^))] A *F . (76) 

Here [7] = [5] = 1/ length. Accordingly, it is not too difficult to introduce a 
coupling of torsion to electromagnetism. However, the price one has to pay is the 
introduction of new natural constants 7 and <5. In other words, even if possible, 
we don’t take such models too seriously. 

Also non- minimally coupled nonmetricity could be installed by additional 
quadratic pieces such as 

H = h[l + C'(e„JQ“^)(e^jg^)] *F . (77) 

Therefore, there are quite a number of different “admissible” options available 
as soon as we allow non-minimal couplings to arise. 

8 Outlook 

Using astronomical observations on the propagation of light, the upper bounds 
for non-minimal coupling effects should be determined in a systematic way, as 
is done, for example, by Haugan and Lammerzahl [28]. For such a purpose, we 
will develop [66] the geometrical optics limit of the Maxwell equations dH = 
J, dF = 0 and will use particular constitutive laws, as, e.g., the linear law. 
Possible couplings to curvature, torsion, and nonmetricity should come under 
sharper focus in this way. Non-linear effect a la Ayon-Beato & Garcia should be 
investigated in the context of, say, the metric-affine gauge theory of gravity. 
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